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A missing energy discovery is possible at the LHC with the first 100 pb^^ of understood data. We 
present a realistic strategy to rapidly narrow the list of candidate theories at, or close to, the moment 
of discovery. The strategy is based on robust ratios of inclusive counts of simple physics objects. We 
study specific cases showing discrimination of look-alike models in simulated data sets that are at 
least 10 to 100 times smaller than used in previous studies. We discriminate supersymmetry models 
from non-supersymmetric look-alikes with only 100 pb~^ of simulated data, using combinations of 
observables that trace back to differences in spin. 
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I. INTRODUCTION 
A. Twenty questions at the LHC 

Many well-motivated theoretical frameworks make 
dramatic predictions for the experiments at the Large 
Hadron Collider. These frameworks are generally based 
upon assumptions about new symmetries, as is the 
case for supersymmetry (SUSY) Q and little Higgs 
(LH)[1, 0], or upon assumptions about new degrees of 
freedom such as extra large ^ or warped @ spatial di- 
mensions. Within each successful framework, one can 
construct a large number of qualitatively different mod- 
els consistent with all current data. Collectively these 
models populate the "theory space" of possible physics 
beyond the Standard Model. The BSM theory space is 
many dimensional, and the number of distinct models 
within it is formally infinite. Since the data will not 
provide a distinction between models that differ by suffi- 
ciently tiny or experimentally irrelevant details, infinity, 
in practice, becomes some large finite number N. The 
mapping of these N models into their experimental sig- 
natures at the LHC, though still incomplete, has been 
explored in great detail. 

As soon as discoveries are made at the LHC, physicists 
will face the LHC Inverse Problem: given a finite set 
of measurements with finite resolutions, how does one 
map back 01-0 to the underlying theory responsible for 
the new phenomena? So far, not enough progress has 
been made on this problem, especially as it relates to the 
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immediate follow-up of an early LHC discovery. 

When N is large, it is not a viable strategy to discrim- 
inate between N alternative explanations by performing 
N tests. However, as the game "twenty questions" illus- 
trates, a well-designed series of simple tests can identify 
the correct alternative in of order log(A^) steps, proceed- 
ing along a decision tree such that, at each branching, of 
order half of the remaining alternatives are eliminated. 

Addressing the LHC Inverse Problem implies designing 
and implementing this series of simple tests in the LHC 
experiments, so that with high confidence a significant 
fraction of the remaining theory space is ruled out at 
each step. The results of the first few tests will shape the 
requirements for future tests, so the immediate need is to 
develop the strategy for the early tests. In this paper we 
provide this strategy for the case of an LHC discovery in 
the inclusive missing energy signature. 



B. Missing energy at the LHC 

The existence of dark matter provides a powerful mo- 
tivation to explore missing energy signatures at the LHC, 
under the assumption that a significant fraction of dark 
matter may consist of weakly interacting thermal relics. 
Missing energy at the LHC is experimentally challenging. 
Most of the energy of 14 TeV pp collisions is carried off 
by undetected remnants of the underlying event, so miss- 
ing energy searches actually look for missing transverse 
energy {E^^^^) of the partonic subprocess. E^^^^ searches 
are plagued by instrumental and spurious backgrounds, 
including cosmic rays, scattering off beam halo and jet 
mismeasurement. Standard Model processes create an 
irreducible E^^^'^ background from processes such as the 
Z boson decay to neutrinos and tt production followed 
by semileptonic decays of the top. 

In many theoretical frameworks with dark matter can- 
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didates, there are heavy strongly interacting particles 
with the same conserved charge or parity that makes the 
dark matter particle stable. These colored particles will 
be pair-produced at the LHC with cross sections roughly 
in the range 0.1 to 100 pb. Their subsequent decays will 
produce Standard Model particles along with a pair of 
undetected dark matter particles. Thus the generic ex- 
perimental signature is both simple and inclusive: large 
E^^^^ accompanied by multiple energetic jets. A detailed 
strategy for early discovery with the inclusive Eip^^^ sig- 
nature was presented in the CMS Physics Technical De- 
sign Report [13] - (l^ and studied with full simulation of 
the CMS detector. After a series of cleanup and analysis 
cuts on a simulated E^^^^ trigger sample targeting the 
reduction of the instrumental and physics backgrounds, 
the signal efhciency remained as high as 25%. These re- 
sults indicate that, for signal cross sections as low as a 
few pb, an E^^^^ discovery could be made with the first 
100 pb^^ of understood LHC data^. In our study we as- 
sume as the starting point that a greater than 5(7 excess 
of events will be seen in a 14 TeV LHC data sample of 
100 pb~^ with an inclusive missing energy analysis. For 
invariability and comparability we effectively adopt the 
full analysis path and requirements used in [lO|. 



C. Look-alikes at the moment of discovery 

At the moment of discovery a large number of theory 
models will be immediately ruled out because, within 
conservative errors, they give the wrong excess. However 
a large number of models will remain as missing energy 
look-alikes, defined as models that predict the same in- 
clusive missing energy excess, within some tolerance, in 
the same analysis in the same detector, for a given in- 
tegrated luminosity. The immediate challenge is then to 
begin discriminating the look-alikes. 

The look-alike problem was studied in [9j as it might 
apply to a later mature phase of the LHC experiments. 
Even restricted to the slice of theory space populated by a 
partial scan of the MSSM, ignoring SM backgrounds and 
systematic errors, and applying an uncorrelated x^-like 
statistical analysis to 1808 correlated observables, this 
study found that a large number of look-alikes remained 
unresolved in a simulation equivalent to 10 fb~^. A more 
recent analysis [l3] attempts to resolve these look-alikes 
in a simulation of a future linear collider. 

At the moment of an early discovery the look-alike 
problem will be qualitatively different. The data sam- 
ples will be much smaller, with a limited palette of ro- 
bust reconstructed physics objects. For example, r or 



^ The first 100 pb~^ of understood LHC data will not be the first 
100 pb~^ of data written to tape. The 10 TeV data collected in 
the early running will be used for calibrations and understanding 
of benchmark Standard Model processes. 



b tagging in multijet final states will be in development 
during the 100 pb~^ era. In many small data samples 
peaks and edges in invariant mass distributions may not 
be visible, and most observables related to detailed fea- 
tures of the events will be rate limited. The observables 
that are available to discriminate the look-alikes in the 
very early running will be strongly correlated by physics 
and systematics making it imprudent to combine them 
in a multivariate analysis. 



D. Is it SUSY? 

By focusing on the discrimination of look-alikes, we are 
pursuing a strategy of simple binary choices: is Model A 
a significantly better explanation of the discovery data 
set than Model B? Each answer carries with it a few 
bits of important fundamental information about the new 
physics process responsible for missing energy. Obviously 
we will need to make many distinct look-alike compar- 
isons before we can hope to build up a clear picture from 
these individual bits. 

Consider how this strategy might play out for answer- 
ing the basic question "is it SUSY?" It may not be pos- 
sible to answer this question conclusively during the 100 
pb~^ era. Our strategy will consist of asking a series of 
more modest questions, some of them of the form: "does 
SUSY Model A give a significantly better explanation of 
the discovery data set than non-SUSY Model B?" None 
of these individual bits of information by itself is equiva- 
lent to answering "is it SUSY?" However we demonstrate 
that we can build up a picture from the data that con- 
nects back to features of the underlying theory. 

Furthermore, we demonstrate a concrete method to ob- 
tain indirect information about the spin of the new par- 
ticles. We establish how to discriminate between a non- 
SUSY model and its SUSY look-alikes. Even though we 
cannot measure the spins of the exotic particles directly, 
spin has significant effects on production cross sections, 
kinematic distributions and signal efficiencies. We are 
thus able to discriminate SUSY from non-SUSY using 
combinations of observables that trace back to differences 
in spin. Our study shows that in favorable cases this can 
be accomplished with data sets as small as 100 pb~^. 



E. Outline 

In section|lT]we review in detail the missing energy dis- 
covery path, including the experimental issues and sys- 
tematics that limit our ability to fully reconstruct events 
from the discovery data set. We explain how the miss- 
ing energy signals are simulated, and the uncertainties 
associated with these simulations. In section IHI] we dis- 
cuss the problem of populating the parts of the theory 
space relevant to a particular missing energy discovery. 
In section ITVl we introduce two groups of look-alike mod- 
els relative to two different missing energy signals. For 
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models differing only by spins, we discuss how cross sec- 
tions, kinematic distributions and efficiencies can be used 
to distinguish them, drawing from formulae developed 
in the Appendix. In section |V] we define all of the ro- 
bust observables that we use to discriminate among the 
look-alikes, and in section IVTl we describe the look-alike 
analysis itself and how we compute the significance of the 
discriminations. Sections lVIII and lVIIII give a summary of 
our results, with details relegated to further Appendices. 
Finally section HXl describes the steps we are following to 
improve this analysis for use with real data. 



II. DISCOVERY ANALYSIS FOR MISSING 
ENERGY 

Missing energy hadron collider data has been used pre- 
viously for successful measurements of Standard Model 
processes with energetic neutrinos; these include the Z'^ 
boson invisible decay rates, the top quark cross section, 
searches for the Higgs boson [3] and a precise extraction 
of the W mass from the reconstruction of the W trans- 
verse mass p^ . Pioneering searches for new phenomena 
in missing energy data sets at the Tcvatron led 
to the development and understanding of the basic tech- 
niques that will be used in missing energy searches at the 
LHC. 

In an ideal detector, with hermetic An solid angle cov- 
erage and excellent calorimeter resolution, the measure- 
ment of missing energy is the measurement of the neu- 
trino energy and the energy of any other neutral weakly 
interacting particles. In a real detector it is also a mea- 
surement of the energy that escapes detection due to 
uninstrumented regions and other detector effects such 
as imperfect calorimeter response. Muons are sources 
of missing energy since a muon typically deposits only 
of order a few GeV of its total energy in the calorime- 
ters^. QCD jets produce real E^^^'^ from semileptonic 
decays of heavy flavor, and fake E^^^^ from detector- 
induced mismeasurements. Thus the E^^^^ distribution 
of a pure QCD multijet sample has a long tail related 
to non-Gaussian tails in the detector response. This 
gives rise to an important background to missing energy 
searches that is difficult to estimate prior to data. At the 
Tevatron it has been shown that this background can be 
brought under control by exploiting the fact that the fake 
^miss fj-Qjji jgt; mismeasurements is highly correlated with 
the azimuthal directions of the leading jets [l6j . 

There are other important sources of fake E™^^^ at 
hadron colliders, including beam halo induced E™^^^, cos- 
mic ray muons, noise in the data acquisition system. 



^ The energy loss of muons is mostly due to ionization up to muon 
energies of 100 GeV. Above 100 GeV bremsstrahlung and nuclear 
losses can cause a single "catastrophic" energy loss comparable 
to the total muon energy. 



and misreconstruction of the primary vertex. Eliminat- 
ing these sources requires unbiased filters based on clean 
definitions of event quality. 

To design a missing energy analysis, we need to have 
some idea of the source of the E^^^^ in the signal. The 
possibilities include: 

• The E™^^'^ is entirely from neutrinos. This could 
arise from the direct decay of new heavy particles 
to neutrinos, or decays of new heavy particles to 
top, W^'s, Z's or r's. One appropriate discovery 
strategy for this case is to look for anomalies in the 
energetic tails of data sets with reconstructed top, 
W's or Z's. 

• The E!^^^^ originates from a single weakly interact- 
ing exotic particle in the final state. An example 
of this possibility is graviton production in models 
with large extra dimensions [20| . If strong produc- 
tion occurs, the signal will consist predominately of 
monojets and large E^^^^. Successful analyses for 
this case were carried out at the Tevatron [2l|, . 
Other signals that fit this case arise from unpar- 
ticle models [l^ and from models with s-channel 
resonances that have invisible decays. 

• The iJ™"**^ originates from many weakly interact- 
ing exotic particles. This can be the case in hidden 
valley models [ISl , where the weakly interacting ex- 
otics are light pions of the hidden sector. This case 
is experimentally challenging. 

• The E^^^^ originates from two weakly interacting 
exotic particles in the final state. This is the case 
for supersymmetry models with conserved R par- 
ity, where the weakly interacting particles are neu- 
tralino LSPs. It also applies for more generic mod- 
els with WIMP dark matter candidates. 

We focus on a discovery analysis developed for the last 
case. Thus we are interested in signal events with two 
heavy WIMPs in the final state. For early discovery 
at the LHC, the signal events should have strong pro- 
duction cross sections; we will assume that each WIMP 
arises from the decay of a strongly interacting heavy par- 
ent particle. The most generic signature is therefore large 
E^^^^ in association with at least two high Et jets. There 
will be additional jets if the WIMP is not produced in a 
2-body decay of the parent particle. Furthermore, there 
is a significant probability of an extra jet from QCD ra- 
diation, due to the large phase space. Thus it is only 
slightly less generic to design an inclusive analysis for 
large E^^^^ in association with three or more energetic 
jets. We will refer to this as the inclusive missing energy 
signature^ . 



^ The requirement of a third energetic jet greatly reduces the size 
and complexity of the Standard Model backgrounds. Thus while 
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In the basic 2 — > 2 hard scattering, the heavy par- 
ent particles of the signal will be produced back-to-back 
in the partonic subprocess center-of-mass frame; typi- 
cally they will have px roughly comparable to their mass 
nip. The WIMPs of mass mdm resulting from the parent 
particle decays will fail to deposit energy > mdm in the 
calorimeters; if the WIMPS have fairly large pr, a sig- 
nificant fraction of this energy contributes to the E™^^^. 
Thus either large rup or large mdm leads to large E™^^^. 
Note the azimuthal directions of the WIMPS are anti- 
correlated, a feature inherited from their parents, so the 
magnitude of the total E^^^^ tends to be less than the 
magnitude of the largest single contribution. 

At the LHC, the most important Standard Model 
sources of large real E^^^^ will be tt, single top, W and 
Z plus jets associated production, dibosons and heavy 
flavor decays. Most of these processes produce a hard 
lepton in association with the E^^^^ from an energetic 
neutrino. The exception is Z vv. Even with a per- 
fect detector, Z ^ vv plus jets is an irreducible physics 
background. 



A. Analysis path 

In the real data this search will be performed start- 
ing from a primary data set that includes requirements 
of missing energy, jets and general calorimetric activity 
at the trigger path; the trigger efSciency should be mea- 
sured in other data samples. 

For the offline analysis, we will adopt the inclusive 
missing energy benchmark analysis studied with the full 
detector simulation for the CMS Physics Technical De- 
sign Report [13, mi- 

The first phase is a preselection based on the event 
quality. The purpose of this primary cleanup is to discard 
events with fake E'™'®^ from sources such as beam halo, 
data acquisition noise and cosmic ray muons. To elimi- 
nate these types of backgrounds the benchmark analysis 
uses jet variables, averages them over the event to de- 
fine corresponding event variables, and uses these to dis- 
criminate real multijet events from spurious back- 
grounds. The event electromagnetic fraction (EEMF) is 
defined to be the Et weighted jet electromagnetic frac- 
tion. We define an event charged fraction (ECHF) as 
the event average of the jet charged fraction (defined as 
the ratio of the X)pt of the tracks associated with a jet 
over the total calorimetric jet Et)- The preselection also 
has a quality requirement for the reconstructed primary 
vertex. 

Events that are accepted by the preselection require- 
ments proceed through the analysis path if they have 



mature LHC analyses will explore the fully inclusive i?™"''' sig- 
nature, we assume here that an early discovery will be based on 
a multijet + E^^^"" data sample. 



missing transverse energy E^^^^> 200 GeV and at least 
three jets with Et> 30 GeV within pseudorapidity I77I < 
3. These requirements directly define the missing en- 
ergy signal signature. In addition the leading jet is re- 
quired to be within the central tracker fiducial volume 
i.e. < 1.7. Everywhere in this paper "jets" means 
uncorrected (raw) jets with Et> 30 GeV and |?7| < 3 
as measured in the calorimeters; the jet reconstruction 
is with a simple iterative cone algorithm with a 0.5 cone 
size in the rj—ip space. The missing energy is uncorrected 
for the presence of muons in the event. 

The rest of the analysis path is designed based on 
elimination of the major backgrounds. The QCD back- 
ground from mismeasured jets is reduced by rejecting 
events where the E^^^^ is too closely correlated with the 
azimuthal directions of the jets. To reduce the large back- 
ground from W{-^ £v)+iets, Z{-^ ilj+jets and tt pro- 
duction an indirect lepton veto (ILV) scheme is designed 
that uses the tracker and the calorimeters. The ILV re- 
tains a large signal efficiency while achieving a factor of 
two rejection of the remaining W and ti backgrounds. 
The veto is indirect because we don't identify leptons- 
instead events are rejected if the electromagnetic fraction 
of one of the two leading jets is too large, or if the highest 
Pt track of the event is isolated. The signals we are inter- 
ested in are characterized by highly energetic jets while 
leptons in the signal originate from cascade decays of the 
parents or semileptonic B decays in the jets; thus even 
when a signal event has leptons it is relatively unlikely 
to be rejected by the ILV. For the models in our study, 
approximately 85% of all signal events and 70% of signal 
events with muons or taus pass the ILV cut. 

The final selections require that the leading jet has 
Et > 180 GeV, and that the second jet has Et > 110 
GeV. We also require Ht > 500 GeV, where 

4 

Ht^J2^t + ^t'" , (1) 

1=2 

where the Et is summed over the second, third and 
fourth (if present) leading jets. These cuts select for 
highly energetic events, greatly favoring events with new 
heavy particles over the Standard Model backgrounds. 

Table H] summarizes the benchmark analysis path. The 
table lists the cumulative efficiencies after each selection 
for a benchmark signal model and the Standard Model 
backgrounds. The signal model is the CMS supersym- 
metry benchmark model LMl, which has a giuino with 
mass 611 GeV and squarks with masses around 560 GeV. 
The last line of the table shows the expected number of 
events that survive the selection in a data set correspond- 
ing to 1000 pb^^ of integrated luminosity. For the QCD 
background and the single top background, which are not 
shown in the table, the estimated number of remaining 
events is 107 and 3, respectively. Thus the total esti- 
mated Standard Model background after all selections is 
245 events per 1000 pb"^ 
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TABLE I: Cumulative selection efficiency after each requirement in the multijets analysis path for a low mass SUSY 

signal and the major Standard Model backgrounds (EWK refers to W jZyVW jZZjZW), see 
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B. Triggers and "boxes" 

Having established a benchmark analysis path, we also 
need to define benchmark data samples. With the real 
LHC data these will correspond to data streams and data 
paths from various triggers. For the inclusive missing 
energy signature relevant triggers are the Ef^^^^ and jet 
triggers. A single lepton trigger is also of interest, since 
many models produce energetic leptons in association 
with large E™^^^. For our study we have chosen simple 
but reasonable [1^, [2^ parametrizations of the trigger 
efficiencies defining our four benchmark triggers ^ : 

• The MET trigger is a pure inclusive E^^^^ trigger. 
It is 50% efficient for £;^"^^> 80 GeV, as seen in 
Figure [T] 

• The DiJet trigger requires two very high Et jets. It 
is 50% efficient for uncorrected jet Et> 340 GeV, 
as seen in Figure O 

• The TriJet trigger requires three high Et jets. It 
is 50% efficient for uncorrected jet Et> 210 GeV, 
as seen in Figure [31 



• The Muon20 trigger requires an energetic muon 
that is not necessarily isolated. The trigger is 88% 
efficient for muons with pr — 20 GeV/c, asymptot- 
ing to 95% as seen in Figure IH 

After applying the selection requirements, these four 
triggers define four potential discovery data sets. In our 
simulation the DiJet, TriJet, and Muon20 data sets, after 
the inclusive missing energy analysis path is applied, are 
all subsets of the MET sample, apart from one or two 
events per 1000 pb^^ ^. Thus the MET is the largest, 
most inclusive sample. We perform one complete analy- 
sis based on the MET trigger. The other three triggers 
are then treated as defining three more boxes, i.e. experi- 
mentally well-defined subsets of the MET discovery data 
set. The simplest physics observables are the counts of 
events in each box. 



C. Backgrounds and systematics 

In the CMS study the total number of Standard Model 
background events remaining after all selections is 245 
per 1000 pb~^ for an E^^^^ trigger sample. The error 
on this estimate is dominated hy i) the uncertainty in 
how well the detector simulation software simulates the 



* These are made-up triggers for the purposes of our study. The 
guidance on our parametrizations is from the published trigger 
and physics reports of the CMS experiment. We expect that the 
trigger tables of the LHC experiments will include corresponding 
trigger paths, richer and better in terms of the physics capture. 



A perfectly designed trigger table will give rise to overlaps among 
datasets from different trigger paths due to both physics and 
slow/non-sharp trigger efficiency turn-ons (resolution). 
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FIG. 1: The trigger efficiency. 



FIG. 3: The TriJet trigger efficiency. 
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FIG. 2: The DiJet trigger efficiency. 



FIG. 4: The Muon20 trigger efficiency. 



response of the actual CMS detector, and ii) the uncer- 
tainty on how weU the Standard Model event generators 
emulate QCD, top production, and W/Z plus jets pro- 
duction. Detailed studies of the real LHC data will be 
required in order to produce reliable estimates of these 
uncertainties. 

Prior to data we assign conservative error bars on these 
background projections. We have checked that 100 pb~^ 
of data in the MET trigger sample is sufhcient for a 5a 
discovery for the eight models in our study, even if we 
triple the backgrounds quoted above and include a 15% 



overall systematic error. The look-alike analysis will be 
degraded, however, in the event that the Standard Model 
backgrounds turn out to be much larger than current 
estimates. 

Prior to data, it is also difhcult to make a reliable esti- 
mate of the main systematic uncertainties that will affect 
the inclusive missing energy analysis. Systematic uncer- 
tainties will decrease over time, as the detectors are bet- 
ter understood, calibration studies are performed, and 
Standard Model physics is analyzed with the LHC data. 
For our study we have assumed that, at the moment of 
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discovery, the dominant systematic errors in the full dis- 
covery data set will come from three sources: 

• Luminosity uncertainty: it affects the counting of 
events. This systematic uncertainty is process in- 
dependent. 

• Detector simulation uncertainty: it mainly affects 
calorimetry-related variables in our study, in par- 
ticular jet counting and the missing energy. This 
systematic is partially process dependent. 

• QCD uncertainty: it includes the uncertainties 
from the parton distribution functions, higher order 
matrix elements, and large logarithms. This uncer- 
tainty affects event counting, jet counting and the 
shapes of kinematic distributions. It is partially 
process dependent. 

Note that, since we use uncorrected jets, we do not 
have a systematic from the jet energy scale. This is 
traded for a portion of the detector simulation uncer- 
tainty, i.e. how well we can map signal events into un- 
corrected jets as would be measured in the real detector. 

D. Simulation of the signals 

A realistic study of look-alikes requires full detector 
simulation. For the initial phase of this work a genera- 
tor level analysis is attractive, being computationally less 
intensive and providing a clear link between observables 
and the underlying theory models^. 

In a generator level analysis, jets are reconstructed by 
applying a standard algorithm to particles rather than 
to calorimeter towers. This obviously does not capture 
the effects of a realistic calorimeter response, calorimeter 
segmentation, and energy losses due to material in the 
tracker as well as magnetic field effects. 

A compromise between the full simulation and a gener- 
ator level analysis is a parameterized detector simulation. 
For the LHC the publicly available software packages in- 
clude AcerDET [27], and PGS [11]. In such a simulation, 
electrons, muons and photons can be reconstructed us- 
ing parameterized efficiencies and resolutions based on 
abstract but educated rules-of-thumb for modern mul- 
tipurpose detectors. Jets are reconstructed in a virtual 
calorimeter, from particle energies deposited in cells that 
roughly mimic the segmentation of a real calorimeter. 
Calorimeter response is approximated by performing a 
Gaussian smearing on these energy deposits. The E™^^^ is 
reconstructed from the smeared energies in these virtual 
towers. 



We performed a preliminary study by comparing PGS 
results to the full simulation results reported for the 
SUSY benchmark model LMl [13, [lH. We found that 
PGS jets are not a good approximation of uncorrected 
jets in the full simulation, even for the most basic prop- 
erties such as the Et spectrum. Varying the parameters 
and adding simple improvements, such as taking into ac- 
count the 4 Tesla field in the barrel, did not change this 
conclusion. PGS jets have a behavior, not surprisingly, 
that is intermediate between generator level jets and un- 
corrected full simulation jets. 

We developed a modified simulation called PGSCMS 
with the geometry and approximate magnetic field of 
the CMS detector. The PGS Gaussian smearing and 
uninstrumented effects in the calorimeters are turned off. 
Electrons, muons and photons are extracted at generator 
level, and PGS tau reconstruction is not used. Track infor- 
mation is extracted as in the standard PGS. The calorime- 
ter output improves on a generator level analysis in that 
we include approximations to the effects of segmentation 
and the 4 Tesla field, as well as an rj correction derived 
from the z value of the primary vertex. We parameterized 
the detector response in a limited set of look-up tables 
as a function of the generator-level quantities. 

At the analysis level we apply parameterized correc- 
tions and reconstruction efficiencies inspired by the pub- 
lished CMS detector performance [29|. For the jets, we 
apply an Et and 77 dependent rescaling of their Et, tuned 
to reproduce the full simulation LMl results in [icj ll]|. 
This rescaling makes the jets softer {i.e. takes into ac- 
count the detector reconstruction): a 50 GeV generator 
level jet becomes an approximately 30 GeV raw jet in our 
analysis. 

The i?™'^*^ reconstructed from PGSCMS is essentially 
identical, modulo small calorimeter segmentation effects, 
to a a generator level analysis, i.e. our E^^^^ is virtually 
indistinguishable from the Monte Carlo truth E^^^^ ob- 
tained from minus the vector sum of the Et of neutrinos, 
muons and the other weakly interacting particles (such 
as the LSP). Wc did not attempt to rescale the i?™"'*'; 
this is a complicated task since Ef^^^^ is a vector and 
in general energy losses, calorimeter response and mis- 
measurements tend to decrease the real large i?™""* tails 
while increasing the E^^^^ tails in the distribution of non- 
real E^^^^ events. Instead of attempting to rescale the 
^miss gygjji; i^y event, we raised the E^^^^ cut in our 
benchmark analysis to 220 GeV*". 

Because of the limitations of our fast simulation, we 
also simplified parts of the benchmark analysis. The first 
phase primary cleanup is dropped since it is related to 
supression of spurious processes that we do not simulate, 
and it is nearly 100% efficient for the signal. We also drop 
the jet electromagnetic fraction cuts of the ILV, because 



The full GEANT4-based simulation is too slow to adequately sam- 
ple the entire theory space. Having completed the first ex- 
ploratory phase of this work, we are repeating the analysis to 
validate these results with the full experimental simulation. 



^ In a realistic full simulation study with the first jet data in hand, 
our B™'"" analysis will avoid such compromises. 
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they are nearly 100% efficient for the signal. 

The resulting performance of our parameterized fast 
simulation for the SUSY benchmark model LMl is shown 
in Table [ni The agreement with the full simulation study 
is very good. The largest single cut discrepancy is 2%; 
this occurs for the QCD angular cuts, reflecting the ex- 
pected fact that our fast simulation does not accurately 
reproduce jet mismeasurement effects. Since the final 
efficiencies agree to within 7%, it is plausible that look- 
alikes defined in our fast simulation study will remain 
look-alikes in our upcoming full simulation study. 

It is important to note that this fast simulation does 
not reproduce the Standard Model background efficien- 
cies shown in Table HI In fact the discrepancies in the 
total efficiencies can approach an order of magnitude. 
This is to be expected. We are cutting very hard on 
the Standard Model events, thus the events that pass are 
very atypical. This is in contrast to the signal events, 
where the fraction that pass are still fairly generic, and 
their Ex and E^^^^ spectra near the cuts are less steeply 
falling than those of the background. Since SM back- 
grounds cannot be estimated from a PGS level analysis, 
we take our backgrounds from the state-of-the-art analy- 
sis in fio'l ; this approach only works because we have also 
matched the analysis path used in p^ . 

The full software chains we use in our study are sum- 
marized in Table IIIII All of the simulated data sets in- 
clude an average of 5 pileup events added to each sig- 
nal event, corresponding to low luminosity LHC running 
(- 10^3 cm-2s-i). 



III. POPULATING THE THEORY SPACE 

In Section [11] we gave a partial classification of BSM 
models according to how many new weakly interacting 
particles appear in a typical final state. Our benchmark 
E^^^^ analysis is optimized for the case of two heavy 
weakly interacting particles per event, as applies to SUSY 
models with conserved R parity, little Higgs models with 
conserved T parity and Universal Extra Dimensions mod- 
els with conserved KK parity. This study is a first at- 
tempt at constructing groups of look-alike models drawn 
from this rather large fraction of the BSM theory space, 
and developing strategies to discriminate them shortly 
after an initial discovery. 

One caveat is that models from other corners of the 
theory space may also be look-alikes of the ones consid- 
ered here. For example, models with strong production of 
heavy particles that decay to boosted top quarks can pro- 
duce higher Et jets and larger E!p^^^ from neutrinos than 
does Standard Model top production. Such look-alike 
possibilities also require study, but they are not a major 
worry since our results show that we have some ability to 
discriminate heavy WIMPS from neutrinos even in small 
data sets. 



A. SUSY 

In a large class of supersymmetry models with con- 
served R parity, not necessarily restricted to the MSSM, 
the LSP is either the lightest neutralino or a right-handed 
sneutrino^. 

In addition, if the NLSP is a neutralino or sneu- 
trino and the LSP is a gravitino, the E^^^^ signature 
is the same. Models based on gravity-mediated, gauge- 
mediated or anomaly-mediated SUSY breaking all pro- 
vide many candidate models. 

Because this relevant portion of SUSY theory space is 
already so vast, there is a temptation to reduce the scope 
of the LHC Inverse Problem by making explicit or im- 
plicit theoretical assumptions. To take an extreme, one 
could approach an early LHC discovery in the Ei^^^^ chan- 
nel having already made the assumptions that (i) the sig- 
nal is SUSY, (ii) it has a minimal Higgs sector (MSSM), 
(Hi) it has gravity-mediated SUSY breaking (SUGRA), 
(iv) the breaking is minimal (mSUGRA) and (v) 100% 
of dark matter is thermal relic LSPs with an abundance 
given by extrapolating standard cosmology back to the 
decoupling epoch. We don't want to make any such as- 
sumptions; rather we want to test theoretical hypotheses 
in the LHC discovery data set combined with other mea- 
surements. 

For SUSY we have the benefit of more than one spec- 
trum calculator that can handle general models, more 
than one matrix element calculator and event generation 
scheme, and a standardized interface via the SUSY Les 
Houches Accord (SLHA) [32]. There are still a few bugs 
in this grand edifice, but the existing functionality com- 
bined with the ability to perform multiple cross-checks 
puts us within sight of where we need to be when the 
data arrives. 



B. Little Higgs 



Little Higgs models are a promising alternative to weak 



scale supersymmetry [33[-[37|. In little Higgs models, the 



Higgs is an approximate Goldstone boson, with global 
symmetries protecting its mass (which originates from a 
quantum level breaking of these symmetries) from large 
radiative corrections. Many of these LH models require 
an approximate T parity discrete symmetry to reconcile 
LH with electroweak precision data. This symmetry is 
similar to R parity in SUSY models. The new LH par- 
ticles that would be produced at the LHC would be odd 
under this symmetry, enforcing the stability of the light- 
est particle that is odd under T parity. This new particle 
is weakly interacting and would manifest itself as missing 



* Recent analyses [3Ct l3lj have argued for the phenomenological 
viabihty of sneutrino dark matter. 
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TABLE II: Comparison of cut-by-cut selection efficiencies for our E^^'^'^ analysis applied to the SUSY benchmark model LMl. 
"Full" refers to the full simulation study [10, EH; "Fast" is what we obtain from our parameterized fast simulation. 



Out / DOitware 


ruil 


rast 


Trigger and 






-C/y > ZUU LieV 


oo.y /O 


b4.o/o 


AT 


70 1 0/ 
lA.L/o 


( l.D/o 


In^-"- 1 > 1 7 


OO. 1 /o 


yu.u /o 


QCD angular 


75.6% 


77.6% 


Jg^lead trk ^ q 


85.3% 


85.5% 


Et,i > 180 GeV, 






Et,2 > 110 GeV 


63.0% 


63.0% 


Ht > 500 GeV 


92.8% 


93.9% 


Total efficiency 


12.9% 


13.8% 



TABLE III: Summary of software chains used in this study. The little Higgs spectrum is based on [s^. PGSCMS is a variation 
of PGS v4 



Software/Models 


Group ] 


L models 


Group 2 models 


Spectrum generator 


Isajet 


V7.69 [39j or 


private little Higgs 




or SUSY 


-HIT vl.l [40] 


or SuSpect v2 . 34 f41j 


Matrix element calculator 


Pythia 


v6.4 [42] 


MadGraph v4 [43] 


Event generator 


Pythia 


v6.4 


MadEvent v4 [44] 








with BRIDGE [45] 


Showering and hadronization 


Pythia 


v6.4 


Pythia v6.4 


Detector simulation 


PGSCMS 


vl.2.5 


PGSCMS vl.2.5 




plus parameterized 


plus parameterized 




corrections 


corrections 



energy at the LHC^. 

Just as in SUSY, new colored particles are the dom- 
inant production modes. These particles subsequently 
generate high multiplicity final states through decay 
chains that end with the lightest T odd particle. In LH 
models, the strongly coupled particles are T odd quarks 
(TOQ's), analogous to the squarks of SUSY. The weakly 
coupled analogues of the gauginos are T odd spin one vec- 
tor bosons (TOV's). In the models considered to date, 
there is no analog of the gluino: this is an important con- 
sideration in constructing supersymmetric look-alikes of 
LH models. 

In this study, we work with a minimal implementation 
of a little Higgs model with T parity that is known as the 
littlest Higgs model with T parity. This model is based 
on a SU{5)/SO{5) pattern of global symmetry breaking. 
Each SM particle except the gluon has an associated LH 
partner odd under T parity. There is also an extra pair 
of top partners, one T odd and the other T even, as well 



This symmetry may be inexact, or violated by anomalies [46l |. 
Such possibilities are model dependent l47l |48| . 



as singlets. The lightest T odd particle in this model we 
label Ah- It is a heavy gauge boson that is an admixture 
of a heavy copy of the hypercharge gauge boson and a 
heavy boson. 

For event generation, we use a private implementation 
of the littlest Higgs model within MadGraph. There is a 
need to generalize this to a wider class of models. 

C. Universal extra dimensions 

Universal extra dimensions models are based on orb- 
ifolds of one or two TeV^^ size extra spatial dimen- 
sions [i^-js^. The five-dimensional version of UED is 
the simplest. At the first level of Kaluza-Klein [KK) 
excitations, each Standard Model boson has an associ- 
ated partner particle, and each Standard Model fermion 
has two associated partner particles (i.e. a vector-like 
pair). These KK partners are odd under a KK parity, 
the remnant of the broken translational invariance along 
the fifth dimension. This parity is assumed to be an ex- 
act symmetry. After taking into account mass splittings 
due to Standard Model radiative corrections, one finds 
that the lightest KK odd partner is naturally the weakly 
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interacting partner of the hypercharge gauge boson. A 
wide variety of spectra for the KK odd partners can be 
obtained by introducing additional interactions that are 
locahzed at the orbifold fixed points; these choices dis- 
tinguish generic UED from the original minimal model 
of [49|. These models resemble SUSY. 

A public event generation code based on a modifica- 
tion of Pythia is available for generic 5-dimensional UED 
models [53] . There is a need to generalize this to a wider 
class of models, e.g. 6-dimensional UED. In our study 
we have not used any UED examples, but we will include 
them in the future. 



IV. DESCRIPTION OF THE MODELS 

A. Group 1 

The five look-alike models of Group 1 are all MSSM 
models. Two of them (LM5 and LM8) are CMS SUSY 
benchmark models, while another (LM2p) is a slight vari- 
ation of a CMS benchmark. It is a sobering coincidence 
that these are look-alikes of the E^^^^^ analysis, since the 
benchmarks were developed by CMS to cover different ex- 
perimental signatures, not produce look-alikes. To round 
out Group 1 we found two other MSSM look-alikes whose 
spectra and decay chains are as different from each other 
and from the three CMS benchmarks as we could make 
them. 

The models are consistent with all current experimen- 
tal constraints, but do not all give the "correct" relic 
density of dark matter. Any comparison of relic densi- 
ties to the so-called WMAP constraints assumes at least 
three facts not yet in evidence: (i) that dark matter is a 
thermal relic, (ii) that there is only one significant species 
of dark matter and (Hi) that cosmological evolution was 
entirely radiation-dominated from the time of dark mat- 
ter decoupling until the time of Big Bang nucleosynthe- 
sis. A missing energy discovery at the LHC will help us 
test whether these assumptions have any validity. For 
example, model LM8 produces a relic density an order 
of magnitude larger than the WMAP upper bound; thus 
discriminating LM8 as a more likely explanation of an 
earl y m issing energy discovery would call into question 
[58I.T59I assumptions (i) and (iii), or could be a hint that 
the lightest neutralino is not absolutely stable. 

LM2p, LM5 and LM8 are minimal supergravity mod- 
els [s^-jill- They are specified by the usual high scale 
mSUGRA input parameters as shown in Table IIVI be- 
cause the resulting superpartner spectra depend strongly 
on RGE running from the high scale, a complete specifi- 
cation of the models also requires fixing the top quark 
mass and the particular spectrum generator program 
used. We have used mtop = 175 GeV and the ISAJET 
v7.69 generator [s^, in order to maintain compatibility 
with the CMS Physics TDR Models LM5 and LM8 
are then identical to the mSUGRA benchmark models of 
the CMS Physics TDR, while LM2p is almost identical to 



benchmark model LM2; LM2p has a slightly larger value 
of mi/2 (360 versus 350 GeV) than LM2, which makes it 
more of a look-alike of the other Group 1 models. 

TABLE IV: Input parameters for the mSUGRA models 
LM2p, LM5 and LM8. The notation comforms to ;39]. The 
mass parameters and trilinear Aq parameter have units of 
GeV. 



LM2p LM5 LM8 



mo 


185 


230 


500 


ni-1/2 


360 


360 


300 


Ao 








-300 


tan P 


35 


10 


10 


sign(^) + + + 



TABLE V: Input parameters for the MSSM models CS4d and 
CS6. The notation conforms to [iol-fiH. The mass parameters 
and trilinear A parameters have units of GoV. 





CS4d 


CS6 


Ml 


620 


400 


M2 


930 


600 


Ms 


310 


200 


-^r, -^t, -^h-i -^e, 


-400 


-300 




340 


2000 




340 


2000 




340 


340 




115600 


115600 


tan (3 


10 


10 


sign(/i) 


+ 


+ 



The Group 1 models CS4d and CS6 are not minimal 
supergravity; they are more general high scale MSSM 
models based on the compressed supersymmetry idea of 
Martin [bI, [64|. The high scale input parameters are 
shown in Table |Vl We have used mtop = 175 GeV 
and the spectrum generator combination SuSpect v2 . 34 
with SUSY-HIT vl.l [iO, |il|. Model CS4d is in fact 
part of the compressed SUSY model line defined in . 
Model CS6 is a modification of compressed SUSY where 
all of the squarks have been made very heavy, ^ 2 TeV. 

The superpartner mass spectra of the Group 1 models 
are displayed in Figure^ One notes immediately that all 
of the mSUGRA models are more similar to each other 
than they are to either of the more general MSSM mod- 
els CS4d and CS6; this shows the limitations of the usual 
SUSY analyses that do not go beyond mSUGRA. As 
their name imphes, the compressed SUSY models CS4d 
and CS6 have a compressed gaugino spectrum relative to 
mSUGRA; this produces either a light gluino (as in CS6) 
or a heavy LSP (as in CS4d). 

The relative frequency of various LHC superpartner 
production processes is summarized in Table IVI[ for the 
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1000 




FIG. 5: The mass spectra of the MSSM models LM2p, LM5, LM8, CS4d and CS6. Only the most relevant particles are shown: 
the lighter gauginos Xi, X2 and xt^ the lightest stau fi, the right-smuon and selectron denoted collectively as Ir, the lightest 
stop ti, the gluino, and the left/right up and down squarks ul, uh, and d^. The very heavy ~ 2 TeV squarks of model CS6 
lie outside the displayed range. 



Group 1 models both before and after our event selec- 
tion. The production fractions are much more similar 
after the event selection than before it; this is expected 
because the selection shapes the kinematics of the surviv- 
ing sample. Gluino pair production dominates for model 
CS6, while squark-gluino and squark-squark production 
dominate for the other four models. Pair production of 
the lightest stop is important for model CS4d before the 
selection cuts, but after the event selection very few of 
these events remain. 

Table IVIII shows the most relevant superpartner de- 
cay branching fractions. For models LM2p and LM5, 
gluino decay is predominantly to quark-f squark; for 
LM8 and CS4d it is dominantly to top and the light- 
est stop, and gluinos decay in CS6 mostly through the 



three-body mode gqxi- For models LM2p, LM5 and 
LM8, left-squarks cascade through quark-f chargino or 
quark-l-second ncutralino; right-squarks have a two-body 
decay to quark+LSP; right-squarks in model LM8 also 
have a large branching to quark-|-gluino. In model CS4d 
left-squarks decay almost entirely to quark+gluino, while 
right-squarks decay almost entirely to quark+LSP; for 
CS6 all squarks except the stop decay dominantly to 
quark-l-gluino. 

In models LM2p, LM5 and LM8 the decays of the 
lightest stop split between 6-|-chargino and top-fLSP; for 
CS4d ii decays 100% via the three-body mode hW^x\, 
while for CS6 almost all of the decays are to top-|-gluino. 

Chargino decay is dominated by decays to the lightest 
stau and a neutrino for models LM2p and CS6, and by 
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TABLE VI: Summary of LHC superpartner production for the Group 1 MSSM models LM2p, LM5, LM8, CS4d and CS6. The 
relative percentages are shown for each model, both before and after the event selection. The squark-squark percentages shown 
are excluding the contributions from pair production of the lightest stops, which are shown separately. Note that squark- 
chargino includes the production of either chargino, and squark-neutralino includes all of the four neutralinos. The category 
"other" includes weak production as well as the associated production of gluinos with charginos or neutralinos. The total NLO 
cross sections are from Prospino2 "gsl. 



LM2p LM5 LM8 CS4d CS6 



NLO cross section (pb) 


8. 




8. 


1 


12 


7 


14 


5 


12 


6 




before 


after 


before 


after 


before 


after 


before 


after 


before 


after 




cuts 


cuts 


cuts 


cuts 


cuts 


cuts 


cuts 


cuts 


cuts 


cuts 


squark-squark 


33% 


36% 


32% 


38% 


22% 


33% 


19% 


34% 


0.1% 


0.1% 


squark-gluino 


45% 


55% 


46% 


52% 


48% 


54% 


41% 


55% 


3.7% 


7.4% 


gluino-gluino 


7.2% 


6.4% 


7.4% 


6.4% 


14% 


8.3% 


11% 


8% 


95% 


92% 


stop-stop 


2.1% 


1.1% 


2.1% 


0.9% 


2.6% 


1.5% 


26% 


1.4% 






squark-chargino 


2.1% 


0.5% 


2.1% 


0.7% 


1.4% 


0.7% 


0.2% 


0.2% 






squark-neutralino 


1.7% 


0.4% 


1.8% 


0.4% 


1.2% 


0.6% 


0.6% 


0.2% 






other 


9.5% 


0.7% 


9.3% 


0.8% 


11% 


0.8% 


1.9% 


0.3% 


1.1% 


0.1% 



TABLE VII: Summary of most relevant superpartner decays 
for the MSSM models LM2p, LM5, LM8, CS4d and CS6. 







LM2p 


LM5 


LM8 


CS4d 


CS6 




9 


45% 


45% 










-> 6i6 


25% 


20% 


14% 


2% 








16% 


23% 


81% 


94% 






QQXi 






5% 




75% 


Ml 




64% 


64% 


55% 










32% 


32% 


27% 








~^ 








83% 


85% 


UR 


-MX? 
—> ug 


99% 


99% 


62% 

38% 


92% 


85% 


bi 




42% 


36% 


35% 


20% 


9% 




-6X^ 


29% 


23% 


22% 


14% 


5% 




-fex? 


7% 


2% 


1% 


50% 






-^bg 










85% 


ii 


-^bxt 


45% 


43% 


42% 








^tx^l 


22% 


25% 


30% 




4% 




^tg 










96% 




bW+Xi 








100% 






w^xl 


5% 
95% 


97% 


100% 


100% 


2% 
77% 


X^ 




1% 


11% 


100% 


100% 






^hx\ 


3% 


85% 






2% 




flT 


96% 


3% 






77% 


fl 


-^rxl 


100% 


100% 


88% 


98% 


100% 



decays to W+LSF for models LM5, LM8 and CS4d. The 
second neutralino X2 decays almost entirely to r+stau 
for models LM2p and CS6, and goes 100% to Z+LSP in 
models LM8 and CS4d. The LM5 model has the distinct 



feature that 85% of X2 decays are to Higgs+LSP. 

Table IVIIII shows the most significant inclusive final 
states for the Group 1 models. By final state we mean 
that all unstable superpartners have decayed, while Stan- 
dard Model particles are left undecayed. We use q to de- 
note any first or second generation quark or antiquark, 
but list bottom and top quarks separately. The percent- 
age frequency of each final state is with respect to the 
events passing our selection. The final states are inclu- 
sive, thus e.g. the events in the qqq XiXi final state are 
a subset of those in the qq XiXi final state, and the total 
percentages in each column exceed 100% . By the same 
token, most exclusive final states actually have more par- 
tons than are listed for the corresponding inclusive entries 
in Table IVIIII so even at leading order parton level they 
produce more jets. 

For models LM2p, LM5 and CS6, the dominant inclu- 
sive final state is ggXiXi + i-6- multijets plus missing 
energy from the two LSPs. This is the motivation behind 
the design of our analysis. For model CS4d, the most 
likely production is squark-gluino followed by squark de- 
cay to quark-l-LSP; the gluino then decays to top-|-stop, 
with the stop decaying via the three-body mode feVF+Xi- 
The most popular exclusive final state is thus btWqXiXi- 
Similarly, for LM8 the most popular exclusive final states 
are btWqxiXi 3'iid ttQXiXi? from squark-gluino produc- 
tion followed by gluino decay to top-f stop. 

Final states with W^s are prevalent in models LM5, 
LM8 and CS4d. The LM2p model stands out because 
of the high probability of taus in the final state. Model 
LM5 produces a significant number of light Higgses from 
superpartner decays. Model LM8 has a large fraction of 
events with Z bosons in the final state. Model CS4d is en- 
riched in final states with multiple tops and W's, of which 
one representative example is shown: bbttWW Xi Xi- 

Summarizing this discussion, we list the most signifi- 
cant features of each model in Group 1: 
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TABLE VIII: Summary of significant inclusive partonic fi- 
nal states for the Group 1 MSSM models LM2p, LM5, LM8, 
CS4d and CS6. By final state we mean that all unstable 
superpartners have decayed, while Standard Model particles 
are left undecayed. Here q denotes any first or second genera- 
tion quark or antiquark, and more generally the notation does 
not distinguish particles from antiparticles. The percentage 
frequency of each final state is with respect to the events pass- 
ing our selection. The final states are inclusive, thus e.g. the 
events in the qqq XiXi final state are a subset of those in the 
qq XiXi final state, and the total percentages in each column 
exceed 100% . 



LM2p LM5 LM8 CS4d CS6 


WX? X? 


57% 


61% 


34% 


38% 


98% 


qqq xS x? 


20% 


19% 


3% 


4% 


79% 


qqqq xl x!i 


1% 


1% 


1% 


1% 


77% 


tVt qxl X? 


39% 


1% 






1% 


TT q xi xi 


25% 


1% 






1% 


hqxl Xl 


30% 


25% 


33% 


69% 


19% 


btWqxl xi 


10% 


19% 


31% 


67% 




Wqxixl 


25% 


52% 


56% 


93% 




hqxl Xl 


3% 


20% 








ttqxl xl 


9% 


4% 


40% 


11% 


2% 


Zqxl xl 


10% 


8% 


35% 


11% 




Z Wqxixl 


2% 


6% 


23% 


6% 




bb tt WW xl xS 






2% 


18% 





TABLE IX: Estimated number of events passing our selection 
per 100 pb~^ of integrated luminosity, for the Group 1 models 
LM2p, LM5, LM8, CS4d and CS6. These estimates use NLO 
cross sections and the CTEQ5L pdfs. 

LM2p LM5 LM8 CS4d CS6 
211 200 195 195 212 



Model LM2p: 800 GeV squarks are slightly lighter 
than the gluino, and there is a 155 GeV stau. Dom- 
inant production is squark-gluino and squark-squark. 
Left-squarks decay about two-thirds of the time to 
quark-l-chargino, and one-third to quark-|-LSP; right- 
squarks decay to quark-|-LSP. Gluino decay is mostly to 
quark-t-squark. Charginos decay to the light stau plus a 
neutrino, while the second neutralino decays to T-|-stau. 
Two-thirds of the final states after event selection have 
at least one r. 

Model LM5: 800 GeV squarks are shghtly fighter 
than the gluino. Dominant production is squark-gluino 
and squark pairs. Left-squark decays about two-thirds 
to quark-f chargino, and one-third to quark+LSP; right- 
squarks decay to quark-|-LSP. Gluino decay is mostly to 
quark-|-squark. Charginos decay to a, W and an LSP, 
while the second neutralino decays to a light Higgs and 
an LSP. After selection more than half of final states have 
a W boson, and a fifth have a Higgs. 



Model LM8: The 745 GeV gluino is shghtly fighter 
than all of the squarks except bi and ti. Dominant pro- 
duction is squark-gluino and squark pairs. Left-squarks 
decay about two-thirds to quark-f chargino, and one- 
third to quark+LSP; right-squarks decay two-thirds to 
quark-|-LSP and one-third to quark+gluino. Gluino de- 
cay is dominantly to top and a stop; the 548 GeV stops 
decay mostly to 6-t-chargino or top-t-LSP. Charginos de- 
cay to ly-l-LSP, and the second neutralino decays to 
Z+LSP. After selection 40% of final states have two tops, 
which may or may not have the same sign. More than 
half of the final states have a W, more than a third have 
a Z , and a quarter have both a W and a Z . 

Model CS4d: The 753 GeV gluino is in between the 
right-squark and left-squark masses. The LSP is rela- 
tively heavy, 251 GeV, and the ratio of the gluino to LSP 
mass is small compared to mSUGRA models. Dominant 
production is squark-gluino and squark-squark. Left- 
squarks decay to quark-l-gluino, and right-squarks decay 
to quark-|-LSP. Gluinos decay to top and a stop; the 352 
GeV stops decay 100% to bW+Xi- Two-thirds of the fi- 
nal states contain btWqxIxi: a significant fraction 
of these contain more b's, t's and W^s. 

Model CS6: The 589 GeV gluino is much fighter than 
the 2 TeV squarks, and the ratio of the gluino to LSP 
mass is small compared to mSUGRA models. Production 
is 92% gluino-gluino, and gluinos decay predominantly 
via the three-body mode qqxi- The final states consist 
almost entirely of three or four quarks plus two LSPs, 
with a proportionate amount of the final state quarks 
being 6's. 

TABLE X: Parameter choices defining the little Higgs model 
LH2. We choose our conventions to agree with those found 
in [3^: / is the symmetry-breaking scale, is the T-odd 
quark Yukawa coupling, k] is the T-odd lepton Yukawa cou- 
pling and sin a is a mixing angle. CKM mixing has been 
suppressed for our analysis. 



/ 


700 GeV 




0.55 




2.0 


sin a 


0.17 



B. Group 2 

Group 2 consists of three look-alike models: LH2, NM4 
and CS7, and a comparison model NM6. LH2 is a lit- 
tlest Higgs model with conserved T parity. The param- 
eter choices defining this model are shown in Table [X] 
The mass spectrum of the lighter partners is shown in 
Figure [6l not shown are the heavier top partners T_|_ , T_ 
with tuned masses 3083 and 3169 GeV respectively, the 
charged lepton partners i'j^, £^ with mass 2522 GeV 
and the neutrino partners vjj, vf^, vfj with mass 2546 
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FIG. 6: The mass spectra of the models LH2, NM6, NM4 and CS7. Only the most relevant partners are shown: the lighter 
gauginos Xi) X2 and Xi , the lightest stau fi, the right-smuon and selectron denoted collectively as la, the gluino, and the 
left/right up and down squarks ul, uji, and d^j. For the little Higgs model LH2, the relevant quark and vector partners are 
shown: the gauge boson partners Ah, Zh, Wh, and the three generations of quark partners u^, d^u, * = !> 2, 3. 



GeV. Model LH2 is consistent with all current experi- 
mental constraints [H, [111 . 

NM6, NM4 and CS7 are all MSSM models. The 
high scale input parameters are listed in Table IXII We 
have used mtop = 175 GeV and the spectrum generator 
SuSpect v2.34. The mass spectra are shown in Figure IHl 
not shown are the heavy gluinos of NM6 and NM4 with 
masses 2000 and 1536 GeV respectively, and the 2 TeV 
squarks of model CS7. 

The SUSY model NM6 was chosen to have a spec- 
trum identical to that of the little Higgs model LH2, 
apart from the heavy gluino that has no counterpart in 
LH2. Thus to a good approximation these two models 
differ only by the spins of the partners. While LH2 and 



NM6 are in this sense twins, they are not look-alikes of 
our benchmark inclusive missing energy analysis. Models 
NM4 and CS7, by contrast, are SUSY look-alikes of the 
little Higgs model LH2. The superpartner spectrum of 
NM4 is roughly similar to the partner spectrum of LH2, 
but the superpartners are lighter. The spectrum of CS7 
has no similarity to that of LH2. 

The relative frequency of various LHC little Higgs part- 
ner production processes are shown in Table IXIIl for the 
LH2 model both before and after our event selection. For 
LH2 the predominant process is gg or qq partons initiat- 
ing QCD production of a heavy partner quark-antiquark 
pair; this process is completely equivalent to tt produc- 
tion at the LHC. The most striking feature of Table IXlIl 
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TABLE XI; Input parameters for the MSSM models NM6, 
NM4 and CS7. The notation conforms to [13, IH. The mass 
parameters and trilinear A parameters have units of GeV. 





NM6 


NM4 


CS7 


Ml 


138 


105 


428 


Ma 


735 


466 


642 


M3 


2082 


1600 


214 


At 





-50 


-321 


At 


4000 





-321 










-321 




755 


590 


2000 




760 


580 


2000 




770 


590 


2000 


M„^,M6^ 


770 


580 


2000 




765 


580 


2000 




2500 


540 


340 




115600 


115600 


115600 


tan f3 


10 


10 


10 


sign(M) 


+ 


+ 


+ 



is that nearly half of the total production involves weak 
interactions. For example the second largest production 
mechanism, 14% of the total, has two valence quarks in 
the initial state producing a pair of first generation heavy 
partner quarks; at tree- level this is from s-channel anni- 
hilation into a W and t-channel exchange of a Zh or Ah 
partner. 

The superpartner production at the LHC for the SUSY 
models NM6, NM4 and CS7 is summarized in Table lXIIll 
For NM6 and NM4 a major contribution is from gg 
or qq partons initiating QCD production of a squark- 
antisquark pair. Production of a first generation squark 
pair from two initial state valence quarks is also impor- 
tant; in contrast to the LH2 non-SUSY analog this is 
a QCD process with t-channel exchange of the heavy 
gluino. For model CS7, which has a light gluino and very 
heavy squarks, 96% of the production is gluino pairs. 

The primary decay modes for the lighter LH2 partners 
are shown in Table IXTVl while those for the SUSY mod- 
els are summarized in Table |XVl Tables IXVll and IXVlIl 
display the most significant inclusive partonic final states 
for the Group 2 models. 

For LH2, a large fraction of heavy partner quarks have 
a direct 2-body decay to a quark and an Ah WIMP. The 
other heavy partner quark decay mode is a two stage 
cascade decay via the Wh and Zh partner bosons. Since 
the Wh decays 100% to WAh while the Zh decays 100% 
to HAh, a large fraction of events have a or a Higgs 
in the final state. 

Analogous statements apply to the SUSY models NM6 
and NM4. We see that 100% of right-squarks and a sig- 
nificant fraction of left-squarks undergo a direct 2-body 
decay to quark+LSP. The rest have mostly a two stage 



cascade via the lightest chargino Xi or the second neu- 
trafino x"- Since xf decays 100% to M^+LSP, while the 
X2 decays dominantly to a Higgs-|-LSP, a significant frac- 
tion of events have a or a Higgs in the final state. 

For the remaining SUSY model CS7, gluino pair pro- 
duction is followed by 3-body decays of each gluino to 
a quark-antiquark pair -|- LSP. As can be seen in Ta- 
ble IXVIII this leads to high jet multiplicity but nothing 
else of note besides a proportionate number of bb and tt 
pairs. 

TABLE XIL Production channels for little Higgs partners in 
the LII2 model, both before and after the event selection. 
Here Q stands for any of the quark partners u\j, d^, i = 
1, 2, 3. The total LO cross section as reported by MadEvent is 
6.5 pb. 



before cuts after cuts 


QiQ^ 


55% 


64% 


Uffd'H, UhU'h, d^Hd-H 


14% 


16% 




12% 


7% 


u}jZh, u'jjAh, d)jZH, d)jAH 


9% 


5% 


QiQj, i / j 


3% 


3% 


other 


7% 


5% 



C. Comparison of models differing only by spin 

We have already noted that SUSY model NM6 has 
a superpartner spectrum almost identical to the heavy 
partner spectrum of the non-SUSY little Higgs model 
LH2. The only relevant difference, other than the spins 
of the partners, is that model NM6 has a very heavy 2 
TeV gluino that has no analog in LH2. Despite being very 
heavy, the gluino does make a significant contribution to 
squark-squark production via t-channel exchange. 

This pair of models provides the opportunity for a com- 
parison of realistic models that within a good approxima- 
tion differ only by the spins of the partner particles. If 
it turned out that these two models were look-alikes in 
our benchmark inclusive missing energy analysis, then 
discriminating them would be physically equivalent to 
determining the spins of at least some of the heavy part- 
ners. 

It is an ancient observation (see e.g. [6^) that models 
differing only by the spins of the new heavy exotics have 
significant differences in total cross section. The most 
familiar example is the comparison of pair production 
of heavy leptons near threshold with pair production of 
spinless sleptons. For mass m and total energy y/s, the 
lepton cross section is proportional to /?, 

(2) 

while the slepton cross section is proportional to 0^ . 
Thus slepton production is suppressed near threshold 
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TABLE XIII: Summary of LHC superpartner production for the Group 2 MSSM models NM6, NM4 and CS7. The relative 
percentages are shown for each model, both before and after the event selection. Here (ji denotes any of the three generations 
of left and right squarks. Note that squark-chargino includes the production of either chargino, and squark-neutralino includes 
all of the four neutralinos. The LO total cross sections are as reported by MadEvent. 





NM6 


NM4 


CS7 


LO cross section (pb) 


2.3 


10.3 


5.0 


before after before after before after 
cuts cuts cuts cuts cuts cuts 


(jiQi 


31% 29% 


34% 26% 




ud, uu, dd 


32% 28% 


29 23% 




squark-gluino 


3% 10% 


5% 23% 


4% 8% 


gluino-gluino 






96% 91% 


squark-chargino 


2% 2% 


3% 1% 




squark-neutralino 


4% 1% 


4% 




i + j 


15% 17% 


17% 14% 




other 


13% 13% 


8% 13% 





TABLE XIV: Decay modes for the lighter little Higgs partners 
of model LH2. 



d'^, i = l,2 




u'Wh 


52% 






d'Zn 


26% 






d'An 


22% 


d% 




hZn 


54% 






bAa 


46% 


u)j, i = 1,2 




d'WH 


31% 






u'Zh 


15% 






u'Ah 


54% 




> 


bWn 


41% 






tAn 


59% 


Wh 




WAh 


100% 


Zh 




hAn 


100% 



compared to production of heavy leptons with the same 
mass. A somevifhat less familiar fact is that the sleptons 
never catch up: even if we introduce both left and right 
sleptons, to match the degrees of freedom of a Dirac lep- 
ton, the total cross section for left+right slepton pairs 
is one-half that of Dirac lepton pairs in the high energy 
limit P ->l. 

For the hadroproduction relevant to our models LH2 
and NM6, the discussion is more complicated: the most 
relevant details and references are presented in our Ap- 
pendix. From Tables IXIII and IXIII} we see a large dif- 
ference in the leading order total LHC cross sections for 
these models: 6.5 pb for LH2 versus only 2.3 pb for NM6. 
Thus the non-SUSY twin has almost a factor of three 
cross section enhancement, in spite of the fact that the 
SUSY model benefits from some extra production medi- 
ated by gluinos. 

The possibility of distinguishing SUSY from non-SUSY 



TABLE XV: Summary of most relevant superpartner decays 
for the MSSM models NM6, NM4 and CS7. 





NM6 


NM4 


CS7 


9 


66% 


67% 




^ bb 


17% 


17% 




iit 


17% 


16% 




qqxi 






99% 


UL dxt 


39% 


59% 


12% 




44% 


12% 




UR -> ux° 


100% 


100% 


4% 


bi 6X2 


24% 


14% 


6% 


- fex'i' 


70% 


86% 




ii ^ bxt 


40% 


27% 




^txl 


60% 


73% 


5% 


xt ^ W^xi 


100% 


100% 


1% 








35% 


^ vi 






28% 


— > VT 






17% 


x^ - ^X? 


22% 


19% 






78% 


81% 




TT 






39% 








45% 


ii 






16% 



twins at the LHC using total cross section was first sug- 
gested by Datta, Kane and Toharia [ij, and studied in 
more detail in [68| . To implement this idea, we must also 
compare the relative efficiencies of the SUSY and non- 
SUSY twins in a real analysis, since what is measured in 
an experiment is not total cross section but rather cross 
section times efficiency. 

An important observation is that the pT distributions. 
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TABLE XVI: Significant inclusive partonic final states for the 
little Higgs model LH2. The percentage frequency of each 
final state is with respect to the events passing our selection. 



qq Ah Ah 


64% 


W qq Ah Ah 


39% 


h qq Ah Ah 


22% 


bb Ah Ah 


14% 


WW qq Ah Ah 


8% 


hh bb Ah Ah 


4% 


hh qq Ah Ah 


3% 


tt Ah Ah 


3% 



TABLE XVII: Significant inclusive partonic final states for 
the Group 2 MSSM models NM6, NM4 and CS7. The per- 
centage frequency of each final state is with respect to the 
events passing our selection. 





NM6 NM4 


CS7 




84% 


83% 


100% 


xi Xi 


8% 


16% 


100% 


mm xi Xi 






95% 


bb q X? Xi 


2% 


5% 


11% 


W qq X? X? 


26% 


35% 




hqxl X? 


14% 


19% 




ttgx? X? 


1% 


1% 


11% 


Zqxl X? 


4% 


5% 




WW q xS X? 


4% 


9% 





in addition to the total cross sections, have large dif- 
ferences due solely to differences in spin. As an exam- 
ple, consider the LHC production of a pair of 500 GeV 
heavy quarks, versus the production of a pair of 500 GeV 
squarks. We can compare the pt distributions by com- 
puting 

dpT o dpT 

wheie we factor out the difference in the total cross sec- 
tions. Using the analytic formulae reviewed in the Ap- 
pendix, we have computed ([3]) for two relevant partonic 
subprocesses. The first is gluon-gluon initiated produc- 
tion, for which the fully differential cross sections are 
given in (|C33|) and (|C34[) : at leading order this arises 



TABLE XVIII: Estimated number of events passing our se- 
lection per 100 pb~^ of integrated luminosity, for the Group 
2 models LH2, NM6, NM4 and CS7. These estimates use 
leading order cross sections and the CTEQ5L pdfs. 

LH2 NM6 NM4 CS7 
94 43 97 91 



from an s-channel annihilation diagram, a gluon seag- 
ull for the squark case, and t and u channel exchanges 
of either the spin 1/2 heavy quark or the spin squark. 
The second example is quark-antiquark initiated produc- 
tion, in the simplest case where the quark flavor does not 
match the quark/squark partner flavor; at leading order 
there is only one diagram: s-channel annihilation. The 
fully differential cross sections are given in (jC19p and 
(IM . 

For this simple example, we have integrated the fully 
differential cross sections over the parton fluxes, using 
the CTEQ5L parton distribution functions. The resulting 
normalized px distributions are shown in Figures [7] and 
[51 For the gg initiated production the SUSY case has a 
significantly softer px distribution, while for the qq initi- 
ated production the SUSY case has a significantly harder 
Pt distribution. 

We see similar differences in the complete models LH2 
and NM6. Figure [9] shows a comparison of the px dis- 
tributions for heavy quark partner production from LH2 
and squark production for NM6. All of the leading order 
partonic subprocesses are combined in the plot, and no 
event selection has been performed. The pr distribution 
for the SUSY model is harder than for the non-SUSY 
model. Part of this net effect is due to intrinsic spin dif- 
ferences e.g. as depicted in Figure [H and part is due 
to SUSY diagrams with virtual gluino exchange. One 
would expect SUSY events to have a higher efficiency to 
pass our missing energy selection than non-SUSY events. 
Indeed this is the case: 19% of NM6 events overall pass 
the selection, whereas only 14% of LH2 events do. The 
higher efficiency of SUSY NM6 events in passing the se- 
lection somewhat compensates for the smaller total cross 
section compared to the non-SUSY LH2. 

The event counts can be obtained by multiplying each 
total cross section times the total efficiency times the 
integrated luminosity. For a 100 pb^^ sample, the total 
signal count is 94 events for model LH2 and 43 events 
for model NM6. The net result is that although LH2 
and NM6 are twins in the sense of their spectra, they are 
not missing energy look-alikes in our benchmark analysis. 
Thus the good news is that, for models that differ only 
(or almost only) by spin, the event count in the discovery 
data sample is already good enough to discriminate them. 
This is one of the important conclusions of our study. 

However this is also something of an academic exercise, 
since in the real experiment we will need to discriminate 
a large class of SUSY models from a large class of non- 
SUSY models. In this comparison a SUSY model can 
be a look-alike of a non-SUSY model even though the 
spectra of partner particles don't match. This is what 
happens with SUSY models NM4 and CS7, which are 
both look-ahkes of LII2. Model NM4 looks particularly 
challenging, since its superpartner spectrum is basically 
just a lighter version of NM6. Compared to NM6, the 
total cross section of NM4 is more than 4 times larger 
(10.3 pb) while the efficiency to pass our missing energy 
selection is only half as good (9%). This gives a total 
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count of 97 events for 100 pb ^, making NM4 a look- 
alike of LH2. 



> 




FIG. 7: Comparison of the normalized pr distributions for 
leading order gg initiated production of a pair of 500 GeV 
particles. The solid (red) line corresponds to quark-antiquark 
pair; the dot-dashed (blue) line to a squark-antisquark pair. 
The distributions have been integrated over the parton fluxes 
using the CTEQ5L pdfs. 



V. OBSERVABLES 

Having in mind an early discovery at the LHC, e.g. 
in the first 100 pb^^ of understood data, we have made 
conservative assumptions about the physics objects that 
will be sufficiently well-understood for use in our look- 
alike analysis of a missing energy discovery. 

We assume that we can reconstruct and count high 
Et jets and hard tracks, as is required for our bench- 
mark missing energy selection. We do not assume that 
validated jet corrections for multijet topologies will be 
available. We assume it will be possible to use the uncor- 
rected (raw) i?™"''' (without subtracting the momentum of 
muons or correcting for other calorimetric effects). 

We assume the ability to reconstruct and count high 
Pt muons; a study of Z ^ M"""/^" events is a nec- 
essary precursor to understanding the Standard Model 
E!p^^^ backgrounds. It will also be possible to count high 
Et electrons, however we are not yet including electrons 
in our study because of the high "fake" rate expected at 
start-up. Multiflavor multilepton signatures are of great 
importance as model discriminators, though challenging 
with small data sets; this is worthy of a separate dedi- 
cated study [HItO]. 

In our study instead of applying sophisticated b and 
T tagging algorithms we isolate enriched samples of b 
quarks and hadronic r's, by defining simple variables sim- 
ilar to the typical components of the complete tagging al- 




"o 100 200 300 400 500 600 700 800 900 1000 

p,^ (GeV/c) 



FIG. 8: Comparison of the normalized pr distributions for 
leading order gq initiated production of a pair of 500 GeV par- 
ticles, for the case that the initial parton flavor does not match 
the final parton flavor. The solid (red) line corresponds to 
quark-antiquark pair; the dot-dashed (blue) line to a squark- 
antisquark pair. The distributions have been integrated over 
the parton fluxes using the CTEQ5L pdfs. 



gorithms: leptons in jets, track counting, and impact pa- 
rameter of charged tracks, to mention a few. This "poor 
man's" tagging is not sufficient to obtain pure samples 
of 6's or r's, but allows the discrimination of look-alike 
models with large differences in the 6 or r multiplicity. 



A. Inclusive counts and ratios 

We build our look-alike analysis strategy using simple 
ingredients. We start with the four trigger boxes defined 
in section [TTbI MET, DiJet, TriJet and Muon20. For 
the simulated data samples corresponding to each box, 
we compute the following inclusive counts of jets and 
muons: 

• N, the number of events in a given box after our 
benchmark selection. 

• N(nj), the number of events with at least n jets 
(n=3,4,5). Note that N(3j) = N because of our 
selection. 

• N(m/^-nj), the number of events with at least n jets 
and m muons (n=3,4 and m=l,2). 

• N(ss/i), the number of events with at least two 
same-sign muons. 

• N(os/i), the number of events with at least two 
opposite-sign muons. 
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FIG. 9: Comparison of the pr distributions for heavy quarks 
from the httle Higgs model LH2 and squarks from the "twin" 
SUSY model NM6. The solid (red) line corresponds to heavy 
quark partners from model LH2; the dashed (blue) line to 
squarks from model NM6. For both models 100,000 events 
were generated using MadGraph and the CTEQ5L pdfs; no se- 
lection was applied. 

In these counts a muon implies a reconstructed muon 
with pt > 20 GeV/c and \ri\ < 2.4 with no isolation 
requirement. 

From these inclusive counts we can define various in- 
teresting ratios. All of the ratios are of correlated ob- 
servables. Examples of ratios are: 

• r(nj)(3j)=N(nj)/N(3j), with n=4,5, a measure of 
jet multiplicity. 

• r(2^-nj)(l//-nj)=N(2/i-nj)/N(l^-nj), with n=3,4, a 
measure of muon multiplicity. 

In appropriately chosen ratios of inclusive counts, im- 
portant systematic effects cancel partially or completely. 
For example, if the detector simulation does not precisely 
reproduce the Et spectrum of signal jets as seen in the 
real detector, this introduces a systematic error in the 
N(nj) counts, since jets are only counted if they have 
Et> 30 GeV. However we expect partial cancellation of 
this in the ratio of inclusive counts r(4j)(3j). Another 
large systematic in the jet counts, the pdf uncertainty, 
also cancels partially in the ratios. The luminosity un- 
certainty cancels completely in the jet ratios. As we dis- 
cuss below, ratios of correlated observables are also less 
sensitive to statistical fluctuations. 

In order to enhance the robustness and realism of our 
study, we have cast all of our physical observables into 
the form of inclusive counts and ratios thereof, and our 
look-alike analysis only uses the ratios. This has the 
added advantage of allowing us to compare different dis- 
criminating variables on a more even footing. In the next 



five subsections we explain how this casting into counts 
and ratios is done. 



B. Kinematic observables 

As noted in Section [TTl the distribution of the miss- 
ing transverse energy in the signal events is related to 
TOdm, the mass of the WIMP, as well as to nip, the mass 
of the parent particles produced in the original 2 — > 2 
partonic subprocess. In the benchmark E^^^^ selection, 
we used the kinematic variable Ht, as well as the Et of 
the leading and second leading jets. The distributions of 
these kinematic variables are also related to the underly- 
ing mass spectrum of the heavy partners. 

We have employed two other kinematic variables in our 
study. The first is M, the total invariant mass of all of the 
reconstructed jets and muons in the event. The second 
is Moff, the scalar sum of the i?™"'*' with the Et of all 
the jets in the event. 

Figures [TUlfT51 show a comparison of the A/oft, Ht, M 
and E^^^^ distributions, after selection, for models LM2p, 
CS4d and CS6. These models, though look-ahkes of our 
E^^^^ analysis, have a large spread in their superpartner 
spectra, as is evident from Figure [5] For sufficiently large 
data samples, this leads to kinematic differences that are 
apparent, as seen in Figures fTUlfTSl 

All of the distributions exhibit broad peaks and long 
tails. In principle one could use the shapes of these distri- 
butions as a discrimination handle. This would require a 
deep understanding of the detector or a very conservative 
systematic error related to the knowledge of the shapes. 
The location of the peaks is correlated with the mass rrip 
of the parent particles in the events, but there is no prac- 
tical mapping from one to the other. By the same token, 
this implies that these kinematic distributions are highly 
correlated, and it is not at all clear how to combine the 
information from these plots. 

Our approach to kinematic observables in small data 
sets (low luminosity) is to define inclusive counts based on 
large bins. The dependence on the details of the detector 
simulation is strongly reduced by limiting the number of 
bins and using a bin width much larger than the expected 
detector resolution. 

For Moff we define two bins and one new inclusive 
count for the kinematic distributions in each box: 

• N(Meffl400) the number of events after selection 
with Afeff > 1400 GeV/c2. 

For Ht we also define two bins and one new inclusive 
count: 

• N(IIT900) the number of events after selection with 
Ht > 900 GeV/c2. 

Recall that the selection already required Ht > 

500 GeV. For the invariant mass M we define three bins 
and two new inclusive counts: 
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FIG. 10: Comparison of the Mch distributions for Group 
1 MSSM models LM2p (solid red line), CS4d (dashed blue 
line) and CS6 (dotted magenta line). For each model 100,000 
events were generated then rescaled to 1000 pb~^. 



FIG. 12: Comparison of the distributions of the total invari- 
ant mass of jets and muons per event for Group 1 MSSM 
models LM2p (solid red line), CS4d (dashed blue line) and 
CS6 (dotted magenta line). For each model 100,000 events 
were generated then rescaled to 1000 pb~^. 
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FIG. 11: Comparison of the Ht distributions for Group 1 
MSSM models LM2p (solid red hue), CS4d (dashed blue 
line) and CS6 (dotted magenta hne). For each model 100,000 
events were generated then rescaled to 1000 pb~^. 



• N(M1400) the number of events after selection with 
M > 1400 GeV/c2; 

• N(M1800) the number of events after selection with 
M > 1800 GeV/c2; 

For E™^^'^ we define four bins and three new inclusive 
counts: 

• N(MET320), the number of events after selection 
having £;f?'^"> 320 GeV. 
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FIG. 13: Comparison of the Et'"" distributions for Group 
1 MSSM models LM2p (solid red line), CS4d (dashed blue 
line) and CS6 (dotted magenta line). For each model 100,000 
events were generated then rescaled to 1000 pb~^. 



• N(MET420), the number of events after selection 
having £;^'^"> 420 GeV. 

• N(MET520), the number of events after selection 
having Ef'^''> 520 GeV. 

Note that the Eip^^'^ selection already required E^^^^> 
220 GeV. 
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C. Kinematic peaks and edges 

With large signal samples, kinematic edges involving 
leptons will be a powerful tool for model discrimination 
and to eventually extract the mass spectrum of the heavy 
partners. With small samples, in the range of 100 pb^^ to 
1000 pb~^ considered in our study, this will only be true 
in favorable cases. In fact for the 8 models studied here, 
we find no discrimination at all based on kinematic edges 
with leptons. This is due mostly to the small number of 
high pt muons in our signal samples , as well as a lack 
of especially favorable decay chains. 

Although we do not observe any dimuon edges, we do 
see a dimuon peak for model LM8. This is shown in 
Figure p^ . for signal events after our missing energy 
selection rescaled to 1000 pb~^ of integrated luminosity. 
A Z peak is clearly visible, arising from squark decays to 
quark +X2, with the X2 decaying 100% to Z+ LSP. 
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FIG. 14: The invariant mass distribution for opposite sign 
dimuon pairs passing the missing energy selection, from 
MSSM model LM8. The plot is from 100,000 generated events 
rescaled to 1000 pb~^. 



D. Event shapes, hemispheres and cones 

Event shapes are a way to extract information about 
the underlying partonic subprocess and the resulting de- 
cay chains. This information is not uncorrelated with 



Low Pt lepton and dilepton trigger paths as well as cross-triggers 
combining leptons with jets and leptons with missing energy re- 
quirements are needed for Standard Model background calibra- 
tion and understanding; these will be important for signal ap- 
pearance and edge/threshold studies even at start-up. The LHC 
experiments are preparing rich trigger tables along these lines 



kinematics, but it does have the potential to provide qual- 
itatively new characteristics and properties of the event 
topology. For our signal the partonic subprocess con- 
sists of the production of two heavy partner particles, so 
each event has a natural separation into two halves that 
we will call "hemispheres" , consisting of all of the decay 
products of each partner. Associated to each hemisphere 
of the event should be one WIMP plus some number of 
reconstructed jets and muons. A perfect hemisphere sep- 
aration is thus an associative map of each reconstructed 
object into one of the two constituent decay chains. 

We can define two hemisphere axes in rj-cj) as the di- 
rections of the original parent particles; these axes are 
not back-to-back because of the longitudinal (and trans- 
verse) boosts from the subprocess center-of-mass frame 
back to the lab frame. Even if we knew event by event 
how to boost to the center-of-mass frame, the hemisphere 
separation would still not be purely geometrical, since in 
some events a decay product of one parent will end up in 
the geometrical hemisphere of the other parent. 

Having defined a perfect hemisphere separation, we 
need a practical algorithm to define reconstructed hemi- 
spheres. We will follow a study of 6 hemisphere algo- 
rithms presented in the CMS Physics TDR [101 . These 
algorithms are geometrical and kinematic, based on the 
large though imperfect correlation between the rj-cj) vec- 
tor of the initial parent and the reconstructed objects 
from this parent's decay chain. The algorithms consist 
of two steps: a seeding method that estimates the two 
hemisphere axes, and an association method that uses 
these two axes to associate each reconstructed object to 
one hemisphere. 

For the Group 1 model LM5, the CMS study found 
that jets were correctly associated to parent squarks 87% 
of the time, and to parent gluinos 70% of the time. The 
differences in the performance of the 6 algorithms were 
small; the best-performing hemisphere algorithm com- 
bines the following methods: 

• Seeding method: The two hemisphere axes are cho- 
sen as the rj-cf) directions of the pair of reconstructed 
objects with the largest invariant mass. The hard- 
est of these objects defines the leading hemisphere. 

• Association method: A reconstructed object is as- 
signed to the hemisphere that minimizes the Lund 
distance [Toj . 

Figure [15] shows how well the seeding method produces 
hemisphere axes that match the actual axes defined by 
the two parent particles. The separation is shown in units 
of AR, defined as 

AR = v/(A?7)2 + (A0)2 . (4) 

The agreement is not overwhelmingly good, and is sub- 
stantially worse than that obtained for ti events, as 
shown in Figure [TH We have checked that all 6 hemi- 
sphere algorithms produce very similar results. Since the 
agreement is better for the leading hemisphere, all of our 
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single hemisphere derived observables are based just on 
the leading hemisphere. 

We define three inclusive counts based on comparing 
the two reconstructed hemispheres: 

• N(Hemj) the number of events for which the recon- 
structed object multiplicity (jets + muons) in the 
two hemispheres differs by at least j, j=l,2,3. 

Once the two hemispheres are identified, we can break 
the degeneracy among the models by looking at the 
topology in the events. For a given mass of the parent 
particles, the events will look more jet-like rather than 
isotropic if the decays are two-body rather than multi- 
body cascades. In the case of jet-like events, the projec- 
tion of the observed track trajectories on the transverse 
plane will cluster along the transverse boost of the par- 
ticles generating the cascade. 

In order to quantify this behaviour, we start from the 
central axes of each hemisphere and draw slices in the 
transverse plane with increasing opening angles in (j) sym- 
metric around the hemisphere axis. We refer to the slices 
as cones, reminiscent of the cones used in CLEO analy- 
ses [T^I to discriminate between jet-like QCD background 
and isotropic decays of B meson pairs. 

We build five cones of opening angle 2a {a = 30°, 45°, 
60°, 75° and 90°) in each hemisphere. In terms of these 
cones we define variables: 

• N(nt-ca), the number of events having at least n 
tracks (n=10,20,30,40) in the leading hemisphere 
cone of opening angle 2a. 

• N(ntdiff-CQ!), the number of events having a differ- 
ence of at least n tracks (n=10,20,30,40) between 
the cones of opening angle 2a in each hemisphere. 

Tracks are counted if they have pt > 1 GeV/c and \ri\ < 
2.4. Since the cone of opening angle 2a includes the one 
of opening angle 2(3 for a > /3, these variables have an 
inclusive nature. 
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FIG. 15: The distribution of the AR separation between the 
rj-<j) direction of the parent superpartner and the reconstructed 
hemisphere axis. This is from 24,667 events of model LM5 
passing our selection. The solid red line is for the leading 
hemisphere, while the dashed blue line is for the second hemi- 
sphere. 
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E. The stransverse mass mT2 

A potentially powerful observable for model discrimi- 
nation and mass extraction is the stransverse mass vari- 
able mT2 [zll-lzl]. Let us briefly review how this is sup- 
posed to work for our missing energy signal. Ignoring 
events with neutrinos, our signal events have two heavy 
parent particles of mass nip , each of which contributes to 
the final state a WIMP of mass nidm plus some number 
of visible particles. Supposing also that we have a per- 
fect hemisphere separation, we can reconstruct each set 
of visible particles into a 4- vector p^. If we also knew 
the mass and the pt of each WIMP, we could reconstruct 
a transverse mass for each hemisphere from the formula 



2{E, 



X 771dm 

H/rp 



Pt 



(5) 



FIG. 16: The distribution of the Ai? separation between the 
r]-(f) direction of the parent top quark and the reconstructed 
hemisphere axis. This is from 3,000,000 Pythia ti events with 
no selection. The solid red line is for the leading hemisphere, 
while the dashed blue line is for the second hemisphere. 



This transverse mass is always less than or equal to the 
mass rUp of the parent particle. Thus the largest of the 
two transverse masses per event is also a lower bound on 

Of course we do not know the px of each WIMP, only 
the combined E^^^^. Let p^^ and p^^ denote a possible 
decomposition of the total p™"^** into two azimuthal vec- 
tors, one for each WIMP. Note that this decomposition 
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ignores initial state radiation, the underlying event and 
detector effects. Then we can define the stransverse mass 
of an event as 



m^2(TOdm) = 



(6) 



max 



''T2 



/ W\ 2 / (2)^■ 



Since (with the caveats above) one of these partitions is 
in fact the correct one, this quantity is also a lower bound 
on the parent mass mp. 

For a large enough data sample, with the caveats above 
and ignoring finite decay widths, the upper endpoint of 
the stransverse mass distribution saturates at the par- 
ent mass TTip, provided we somehow manage to input the 
correct value of the WIMP mass Wdm- In the approxi- 
mation that the invariant mass mx of the visible decay 
products is small, the lower endpoint of the stransverse 
mass distribution is at rridm- 

These impressive results seem to require that we know 
a priori the correct input value for mdm- However it has 
been shown [7^-[7^ that in principle there is a kink in 
the plot of the upper endpoint value of tot2 as a function 
of the assumed rridm, precisely when the input value of 
JTidm equals its true value. Thus it may be possible to 
extract both nip and mdm simultaneously. 

Summarizing the remaining caveats, the stransverse 
mass method requires a large data sample, and is polluted 
by effects from incorrect hemisphere separation, ISR, the 
underlying event, finite decay widths and detector effects. 
We can compare this to the precision extraction of the 
W mass at CDF [80|, HH , using the transverse mass dis- 
tribution of the charged lepton and neutrino from the 
W decay. Here the WIMP mass is essentially zero, the 
data samples are huge, and hemisphere separation is not 
applicable since there is only one WIMP. In the CDF 
analysis the ISR uncertainty was traded for an FSR un- 
certainty, by interpreting the vector sum of all the calori- 
metric Et not associated with the charged lepton as com- 
ing from ISR plus the underlying event; this then pollutes 
the reconstructed neutrino pT with final state radiation 
from the lepton. The measured transverse mass distribu- 
tion has a considerable tail extending above the putative 
endpoint at mw-, but a precision mass (and width) ex- 
traction is still possible by modeling the distribution. 

In (s^ it was shown that an imperfect hemisphere sep- 
aration greatly degrades the mT2 distribution for simu- 
lated LHC SUSY events. Two approaches to solve this 
problem have been suggested. The first, used in [s^ . 
is to reject events when the total invariant mass of the 
reconstructed objects in either hemisphere exceeds some 
value. This strategy is based on the fact that for a correct 
hemisphere separation with mT2 near the endpoint the 
hemisphere invariant mass is small, while incorrect hemi- 
sphere assignments naturally lead to large hemisphere in- 
variant masses. The second approach, used in (83|, is to 
replace mT2 with a new variable rriTGen- The new vari- 
able minimizes not only over all partitions of the p™"^** 
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FIG. 17: Comparison of tlie stransverse mass mT2 distribu- 
tions for model CS6, varying the cut on the maximum liemi- 
sphere invariant mass. The solid red line shows the mT2 dis- 
tribution when no cut is applied. The dashed blue line shows 
the mT2 distribution rejecting events when the total recon- 
structed invariant mass of either hemisphere exceeds 300 GeV. 
The dotted magenta line shows the mT2 distribution when 
this cut is lowered to 200 GeV. In each case 100,000 events 
were generated then rescaled to 1000 pb~^. The dotted hor- 
izontal lines are to guide the eye for where the distribution 
cuts off for 100 pb"^ and 1000 pb"^ 



into pi^"^ and p^\ but also over all possible hemisphere 
assignments. Since one of the hemisphere separations is 
in fact the correct one, rnxGen has the same endpoint as 
mT2 would have with a perfect hemisphere separation. 

Figure [T7] shows the degradation of the mT2 distribu- 
tion for our model CS6. The 771^2 endpoint should be 
at 589 GeV, the value of the gluino mass in CS6. How- 
ever the solid red line shows that a large fraction of events 
are above this endpoint, due to the irnperfect hemisphere 
separation. Applying the strategy of [84I, the dotted ma- 
genta line shows that we regain the correct endpoint by 
making a hard cut of 200 GeV on the maximum recon- 
structed hemisphere invariant mass. However with such 
a high requirement we take a big hit in statistics. The 
dashed blue line shows that we still do pretty well with 
a 300 GeV requirement, while gaining a lot in statistics. 
For this study we have used the 300 GeV requirement 
in all of our mT2 analysis. The value is unoptimized 
but also unbiased, since it was determined by asking for 
approximately the most stringent cut that retains rea- 
sonable statistics for the 771^2 distributions of the entire 
set of models considered. 

Figure [T51 shows a comparison of the 771^2 distributions 
for two of our Group 1 look-alike models, LM2p and CS6. 
For LM2p, the parents are gluinos of mass 856 GeV and 
squarks of mass approximately 800 GeV; for model CS6, 
by contrast, the parents are gluinos of mass 589 GeV. In 
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FIG. 18: Comparison of the stransverse mass mT2 distribu- 
tions for look-alike models LM2p (solid red line) and CS6 
(dashed blue line). For each model 100,000 events were gen- 
erated then rescaled to 1000 pb~^. The dotted horizontal 
lines are to guide the eye for where the distribution cuts off 
for 100 pb-i and 1000 pb'^ 



each case we have input the correct LSP mass, 142 GeV 
for LM2p and 171 GeV for CS6. 

Each plot is rescaled to 1000 pb~^. With this many 
events notice that we are just starting to saturate the 
appropriate endpoints at mT2 = Wp, and notice the onset 
of tails above the endpoints. The dotted lines in the 
figure guide the eye to where the distributions cut off for 
data samples of 100 pb~^ and 1000 pb~^. Obviously for 
100 pb""'^ we are not close to populating the endpoints. 

However even for 100 pb~^ there are significant differ- 
ences between the mT2 distributions of the two models. 
These differences only become larger if we use the same 
input mass for the LSP. Thus mT2 is at least as interest- 
ing for look-alike discrimination as the more traditional 
kinematic variables discussed above. Furthermore, even 
if we are not close to populating the endpoint, it might 
be possible to extract a direct estimate of Wp by fitting 
or extrapolating the distributions. 

For our study we define five bins and four new inclusive 
counts from mT2'- 

• N(mT2-300) the number of events after selection 
with mT2 > 300 GeV/c2, 

• N(mT2-400) the number of events after selection 
with mT2 > 400 GeV/c^, 

• N(mT2-500) the number of events after selection 
with mT2 > 500 GeV/c^. 

• N(mT2-600) the number of events after selection 
with mT2 > 600 GeV/c^. 



When comparing a model Ml, playing the role of the 
data, with a model M2, playing the role of the model to 
test, we will use the mass of the WIMP in model M2 as 
the input mass in calculating mT2 for both models. 



F. Flavor enrichment 

In order to have some model discrimination based on 
the r or 6 content, we need simple algorithms to create 
subsamples enriched with b quarks and r's. We refer to 
these algorithms as "tagging" , despite the fact that the 
tagging efficiencies and the purity of the subsamples are 
rather poor. 

Without attempting any detailed optimizations, we 
have designed two very simple tagging algorithms. We 
expect these algorithms to be robust, since they only re- 
quire a knowledge of uncorrected high Et jets, high pT 
muons, and basic counting of high px tracks inside jets. 

T enrichment: For each jet we define a 0.375 cone 
centered around the jet axis. Inside this cone we count 
all reconstructed charged tracks with px > 2 GeV/c. If 
only one such track is found, and if this track has pt > 15 
GeV/c, we tag the jet as a r jet. 

The T algorithm is based on single-prong hadronic t 
decays, which as their name implies produce a single 
charged track. In addition, leptonic decays of a r to 
an electron and two neutrinos can be tagged, since some 
fraction of electrons reconstruct as jets. Soft tracks with 
Pt < 2 GeV/c are not counted, a fact that makes the 
algorithm much more robust. The px > 15 GeV/c re- 
quirement on the single track reduces the background 
from non-r jets. Increasing the cone size decreases the 
efficiency to tag genuine r's, because stray tracks are 
more likely to be inside the cone; decreasing the cone 
size increases the fake rate. A genuine optimization of 
this algorithm can only be done with the real data. 

Table [XlXl shows the results of applying our r tagging 
algorithm to simulations of the Group 1 models LM2p, 
LM5, LM8, CS4d and CS6. The efficiency, defined as the 
number of r tags divided by the number of generator- level 
t's that end up reconstructed as jets, varies between 12% 
and 21%. The efficiency is lowest for models LM8 and 
GS4d, models where r's come entirely from W and Z 
decays. The efficiency is highest for model LM2p, which 
has a large final state multiplicity of r's from decays of 
charginos, second neutralinos and staus. 

The purity, defined as the fraction of r tagged jets that 
actually correspond to generator-level r's, is quite low for 
models LM8 and CS4d, and is only 8% for model CS6, 
which contains very few r's. We obtain a reasonably high 
purity of 55% for LM2p, the model with by far the largest 
r multiplicity. 

We conclude that it is possible to obtain significantly 
enriched samples of r's from our simple algorithm, but 
only for models that do have a high multiplicity of ener- 
getic r's to begin with. From the counts in Table IXiXl it 
is clear that this tagging method is not viable with 100 
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pb ^ of integrated luminosity. 

TABLE XIX: Results of our r tagging algorithm applied 
to the Group 1 models LM2p, LM5, LM8, CS4d and CS6. 
Counts are rescaled to 1000 pb~^ from 100,000 events per 
model. The listing for r jets counts generator level r's that 
are reconstructed as jets in events that pass our selection. 



LM2p LM5 LM8 CS4d CS6 


r jets per fb^^ 


409 144 171 


112 34 


tags per fb^^ 


157 110 122 


102 59 


correct 






tags per fb~^ 


86 25 21 


14 5 


efficiency 


21% 18% 12% 


13% 16% 


purity 


55% 23% 17% 


14% 8% 



b enrichment: For each jet we search for a recon- 
structed muon inside the jet (recall that our muons have 
Pt > 20 GeV/c and \t]\ < 2.4). If a muon is found within 
AR < 0.2 of the jet axis we tag it as a 6 jet. 

This b algorithm is based on tagging muons from 
semileptonic B decays inside the 6 jet. This is inspired by 
the "soft muon" tagging that was used in the top quark 
discovery at the Tevatron [H, HH] ■ In our case "soft" is 
a misnomer, since in fact we only count reconstructed 
muons with pT > 20 GeV/c. This requirement makes 
the tagging algorithm more robust, but reduces the effi- 
ciency. 

Table IXXI shows the results of applying our b tagging 
algorithm to simulations of the Group 1 models LM2p, 
LM5, LM8, CS4d and CS6. The tagging efHciency is de- 
fined as the number of b tags divided by the number of 
generator-level 5's that are within AR < 0.3 of the center 
of a reconstructed jet. Although all of these models have 
a high multiplicity of generator-level &'s, the tagging ef- 
ficiency is poor: only about 5% for all models. However 
the purity of the samples is rather good: above 70% for 
every model except CS6. 

We conclude that it is possible to obtain significantly 
enriched samples of 6's from our simple algorithm, but 
with low efficiency. From the counts in Table IXXi it 
is clear that this tagging method is not viable with 100 
pb^^ of integrated luminosity, but should become useful 
as we approach 1000 pb^^. 

In our study, discrimination based on r's and 6's is ob- 
tained from ratios that involve the following two inclusive 
counts: 

• N(T-tag), the number of events after selection hav- 
ing at least one t tag. 

• N(6-tag), the number of events after selection hav- 
ing at least one soft muon b tag. 



VI. THE LOOK-ALIKE ANALYSIS 

The look-alike analysis proceeds in four steps: 



TABLE XX: Results of our 6 tagging algorithm applied to the 
Group 1 models LM2p, LM5, LM8, CS4d and CS6. Counts 
are rescaled to 1000 pb~^ from 100,000 events per model. The 
listing for b jets counts generator level 6 quarks matched to 
reconstructed jets that pass our selection. 



LM2p LM5 LM8 CS4d CS6 


b jets per fb~^ 


1547 1693 2481 


1596 748 


tags per fb~^ 


115 112 148 


105 106 


correct 






tags per fb~^ 


82 81 112 


75 41 


efficiency 


5% 5% 5% 


5% 5% 


purity 


72% 72% 75% 


71% 39% 



1. We choose one of the models to play the part of 
the data. We run the inclusive E^^^^+jets analysis on 
the MET trigger and verify that the predicted yield es- 
tablishes an excess (at > 5(t) above the SM background 
with 100 pb~^. We call the number of events selected 
in this way the observed yield Ndata- In what follows, 
we assume that a subtraction of the residual Standard 
Model background has already been performed. We as- 
sume large signal over background ratios for the models 
considered so that the statistical error on the background 
has a small impact on the total error. 

2. We identify a set of models giving a predicted yield 
N compatible with A^data- The compatibility is estab- 
lished if the difference in the two counts is less than twice 
the total error, i.e if the pull 

liVdata - N\ 

is smaller that two. In the formula a{N) represents the 
error associated to the expected number of events N. We 
calculate it as the sum in quadrature of several contribu- 
tions: 

• A Poissonian error which takes into account the 
statistical fiuctuations associated to the event pro- 
duction (statistical component of the experimental 
error). 

• An error associated to the detector effects (system- 
atic component of the experimental error). 

• Theoretical error on the predicted number of events 
N (including a statistical and a systematic compo- 
nent) . 

We discuss the origin of each contribution below. 

3. For each additional observable N'' previously listed, 
we consider the value on the data (^jata) ^'^'^ pre- 
dicted value Nj for the model j. We calculate the pull 
as in eqn. [7] and we identify the variable with the largest 
pull as the best discriminating counting variable. We ig- 
nore all the variables for which both the model and the 
data give a yield below a fixed threshold A^min- We use 
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■^min = 10, i.e. we require a minimum yield that is more 
than three times its Poisson error \J iV''; for the data this 
corresponds to excluding at Scr the possibility that the 
observed yield is generated by a fluctuation of the back- 
ground. 

4. We form ratios of some of the observables used 
above and we repeat the procedure of step 3. Since part 
of the uncertainties cancel out in the ratio, these variables 
allow a better discrimination than the counting variables. 
In addition, provided that the two variables defining the 
ratio are above the threshold N-cm-cn the ratios of two cor- 
related variables (such as N{4:j)/N{3j)) are less sensitive 
to the statistical fluctuations. Details on the calculation 
of the errors on the ratios are given below. 

In each of the four trigger boxes we define the following 
ratios of correlated inclusive counts: 

• r("-j)(3j), with n=4,5 

• r(MET320) 

• r(MET420) 

• r(MET520) 

• r(HT900) 

• r(MeffI400) 

• r(M1400) 

• r(M1800) 

• r(IIemj) with j— 1,2,3 

• r(2/i-nj)(l/i-nj) with n=3,4 

• r(r-tag) 

• r(fe-tag) 

• r(mT2-300) with the theory LSP mass 

• r(mT2-400) with the theory LSP mass 

• r(mT2-500) with the theory LSP mass 

• r(mT2-600) with the theory LSP mass 

• r(mT2-400/300) with the theory LSP mass 

• r(mT2-500/300) with the theory LSP mass 

• r(mT2-600/300) with the theory LSP mass 

• r{nt-ca) for n=10,20,30,40 and a = 30°,45°, 60°, 
75°, 90° 

• r(ntdiff-ca) for for n=10,20,30,40 and a = 30°, 45°, 
60°, 75°, 90° 



For most of these ratios the numerator is the correspond- 
ing count defined in Sect ion fVl and the denominator is the 
total event count in the trigger box. The exceptions are 
r(nj)(3j)=N(nj)/N(3j), r(2/i-nj)(lAi-nj)=N(2A*-nj)/N(l/i- 
nj), and r(mT2-n/300)=N(mT2-n)/N(mT2-300), n ^ 
400, 500, 600. We also use the ratios of the counts in 
the DiJet, TriJet and Muon20 boxes to the count in the 
MET box: 

• r(DiJet) 

• r(TriJet) 

• r(Muon20) 

As mentioned previously, it turns out that the DiJet, 
TriJet and Muon20 boxes are subsamples of the MET 
box to an excellent approximation, thus these ratios are 
also ratios of inclusive counts. 

Finally we iterate and perform the transpose compar- 
isons (the model that was considered as data takes the 
role of the model). 

A. Theoretical uncertainty 

We take into account several sources of uncertainty. 
First of all, there is an error associated to the knowledge 
of the parton probability density functions (pdfs) that 
are used to generate the event samples. In order to eval- 
uate this error, we produce and ana.lyze all samples with 
three different sets of pdfs: CTEQSL M, CTEQ6M and 
MRST2004nlo [13 or MRST2002nlo [H] for Group 1 and 
Group 2 respectively. We quote as central value the aver- 
age of the three values; for the pdf uncertainty we crudely 
estimate it by taking half the spread of the three values. 
This uncertainty, as we will show, has important effects 
on the results. 

An additional error is given by the relative QCD scale 
uncertainty when we compare different look-alike mod- 
els. This is an overall systematic on the relative cross 
sections that we take to be 5%. It is actually larger than 
this in our study, at least for the Group 2 models where 
we use LO cross sections, but we are assuming some im- 
provement by the time of the real discovery. 

There is an additional uncertainty for each observable 
from the missing higher order matrix elements. It is not 
included in the analysis shown here. It could be included 
crudely by running Pythia with different values of the 
ISR scale controlled by MSTP(68), similar to how we 
evaluate the pdf uncertainties. A better way is to include, 
for the signals, the higher order matrix elements for the 
emission of extra hard jets. The ideal approach would be 
a full NLO generator for the signals. 

The sum in quadrature of all these effects gives the 
systematic error associated to the theoretical prediction. 
In the case of ratios, the error on the cross section cancels 
out. In a similar way, the correlated error on the pdfs 
cancels out by calculating the ratios for the three sets of 
pdfs and then averaging them. 
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In the case of mSUGRA models, the result of the sim- 
ulation also depends on which RGE evolution code we 
use^^ to go from the parameters at the high scale to the 
SUSY spectrum at the Terascale. Rather than including 
an error associated to such differences we take one of the 
codes (Isajet v7.69 or SuSpect v2 . 34) as part of the 
definition of the theory model we arc considering. 

The theory predictions are also affected by a statistical 
error, related to the fact that the value of each observ- 
able is evaluated on a sample of limited size. Generating 
the same sample with a different Monte Carlo seed one 
obtains differences on the predicted values of the observ- 
able. The differences, related to statistical fluctuations, 
are smaller for larger generated data sets. Considering 
that each number of events N^^ for observable i and model 
j can be written as = ejx and that the error on 
is already accounted for in the systematic contribution to 
the theoretical error, the efficiency ej has an associated 
binomial error: 



GEN 



(8) 



where iVcEN is the size of the generated sample before 
any selection requirement. This error can be made negli- 
gible by generating data sets with large values of A'^gen- 
We include the contribution of the statistical error sum- 
ming it in quadrature to the systematic error. 

When the variables defining the ratio are uncorrelated, 
the error on the ratio is obtained by propagating the 
errors on the numerator and denominator, according to 
the relation 



cr(r) 



iVdc 



cnj 



(9) 



where r = iVnum/A^don- 

This is not the correct formula in our case, since all 
of the counts on our ratios are correlated. For instance, 
N{4j) and N{3j) are correlated, since all the events with 
at least four jets have also three jets. Only a fraction 
of the events defining -/V(3j) will satisfy the requirement 
of an additional jet, i.e. applying the requirement of an 
additional jet on the > 3 jets sample corresponds to a bi- 
nomial process, with the ratio ^■(4j)(3j) = N{4j)/N{3j) 
the associated efficiency. The error on r is then given 
by eqn. [51 replacing ej with the r and iVcEN with N{4:j). 
The same consideration applies to all the ratios built from 
correlated variables. In order to use eqn. [5] for the error, 
we always define the ratios such that they are in the range 
[0,1]. 



For the CMS benchmark models, we used Isajet v7 . 69 but com- 
pared the spectra results with SuSpect v2 . 34 + SUSY-HIT v . 1 . 1 
and SoftSusy v. 2. 0.14 [88l |. The differences in the computed 
spectra led to differences in our observed yield of 3 to 10%. 



B. Statistical uncertainty 

The production of events of a given kind in a detector 
is a process ruled by Poisson statistics. The error on a 
counting variable N is given by ^/N. 

In analogy to the statistical error on the theoretical 
predictions, the statistical error on a ratio of two corre- 
lated variables is calculated according to eqn. [51 replacing 
ej with the r and iVcEN with the value of the denomi- 
nator variable for the reference data luminosity. Unlike 
the case of the theoretical error, this error is associated 
to the statistics of the data set and not to the size of the 
generated sample; this error is intrinsic to the experimen- 
tal scenario we are considering and cannot be decreased 
by generating larger Monte Carlo samples. 



C. Systematic uncertainty 

We consider two main sources of systematic error, the 
knowledge of the collected luminosity and the detector ef- 
fects on the counting variables. Estimating the two con- 
tributions to be of the order of 10%, we assign a global 
systematic error of 15% to the counting variables. When 
calculating the ratios, we expect this error to be strongly 
reduced. On the other hand the cancellation is not ex- 
act, and it will be less effective at the start-up, because 
of potentially poorly-understood features related to the 
reconstruction. Hence we associate a residual systematic 
error of 5% to the ratios. 



VII. SUMMARY OF GROUP 1 RESULTS 



Table IXXll summarizes the results from the five MSSM 
models of Group 1. There are 20 pairwise model com- 
parisons. One model is taken as the simulated data, with 
the observed yield scaled to either the 100 pb^^ discovery 
data set or a 1000 pb^^ follow-up. The actual number 
of events generated in each case was 100,000, thus the 
"data" has smaller fiuctuations than would be present in 
the real experiment; we are interested in identifying the 
best discriminators and their approximate significance, 
not simulating the real experiment. Note, however, that 
in our analysis we include the correct statistical uncer- 
tainty arising from the assumed 100 pb'^ or 1000 pb~^ of 
integrated luminosity in the "data" sample, as described 
in Section [Vll B. When the rescaled counts are below a 
minimum value, the corresponding ratio is not used in 
our analysis, as described in step 3 of analysis procedure 
outlined in Section [VH 

Given a "data" model, we can compare it to four the- 
ory models. We want to understand to what extent we 
can reject each theory model, based on discrepancies in 
our discriminating ratios. With the real LHC data, the 
test needs to be performed as follows: given Model A and 
Model B, we will ask if Model A is a better description of 
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TABLE XXI: Summary of the best discriminating ratios for model comparisons in Group 1. The models listed in rows are taken 
as simulated data, with either 100 or 1000 pb^^ of integrated luminosity assumed, and uncertainties as described in the text. 
The models listed in columns are then compared pairwise with the "data" . In each case, the three best distinct discriminating 

ratios are shown, with the estimated significance. By distinct we mean that we only fist the best ratio of each type; thus if 
r(5j)(4j) is listed, then r(4j)(3j) is not, etc. The asterix on the ratio r(b-tag) indicates that it is defined in the Muon20 box; all 
other ratios are defined in the MET box, and r(Muon20) denotes the ratio of the number of events in the Muori20 box to the 
number in the MET box. The mT2 ratios are computed using the LSP mass of the relevant "theory" model, not the "data" 
model. 
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the data than Model B; this is a properly posed hypothe- 
sis test. Every time we reject a theory model as an expla- 
nation of the "data" , we learn something about the true 
properties of the model underlying the "data". When 
several discriminating ratios have high significance, we 
may learn more than one qualitative feature of the un- 
derlying model from a single pairwise comparison; this 



is not always the case however, since many ratios probe 
very similar features of the models and are thus highly 
correlated. 

In general the discriminating power of the robust ratios 

is quite impressive. For 8 of the pairwise comparisons 
at least one ratio discriminates at better than 5(7 with 
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only 100 pb ^ of simulated data"'^^. In only three cases 
(discussed in more detail below) do we fail to discriminate 
by at least 5a with 1000 pb'^ In 14 out of 20 cases 1000 
pb~^ gives > 5(7 discrimination with 3 or more different 
ratios, giving multiple clues about the underlying data 
model. 
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FIG. 19; Comparison of the stransverse mass mT2 distribu- 
tions for look-alike models LM5 (solid red line) and CS4d 
(dashed blue line). For each model 100,000 events were gen- 
erated then rescaled to 1000 pb~^. 



A. LM5 vs CS4d 

We begin with one of the simplest pairwise compar- 
isons: LM5 is treated as the data and compared to the- 
ory model CS4d. Averaging over the three pdfs used in 
our study, we find that LM5 would produce 1951 sig- 
nal events in the 100 pb~^ discovery sample. This is 
only 7% more than the 1817 events predicted by theory 
model CS4d, so the data model and theory model are 
indeed look-alikes. If we peek at the features of the two 
models we see that they have a number of phenomeno- 
logical similarities. Both models have about the same 
proportion of squark-gluino and squark-squark produc- 
tion. In CS4d gluinos decay to stop-top, followed by 
the three-body stop decay ii bW~^Xi; this resembles 
LM5 gluinos decaying to left-squark-quark, followed by 
left-squark cascade to chargino-quark, with the chargino 
decaying to li^x?. Both models have a large fraction of 
events with W^s. 

At the moment of discovery CS4d is excluded as 
an explanation of the LM5 "data" by more than 3a 



As is common practice in high energy physics, we take 5cr and 
3(T as reference values for discovery and evidence, repsectively. 



in three kinematic ratios: r(IIT900), r(Meffl400) and 
r(MET520). These ratios discriminate based on the pro- 
portion of highly energetic events; their values are about 
50% larger for LM5 than for CS4d. This indicates that 
the LM5 signal arises from production of heavier par- 
ent particles. From the superpartner spectra in Figure [5] 
we see that indeed the gluino mass is about 100 GeV 
heavier in LM5 than in CS4d, and the lightest squarks 
are also somewhat heavier. Note that LM5 has a harder 
E!p^^^ distribution even though its LSP mass is ~ 100 
GeV lighter than that of CS4d. 

Since the mT2 endpoint is a direct measure of the 
(largest) parent particle mass, we would expect the mT2 
ratios to be good discriminators. However as can be 
seen in Figure \W\ with 100 pb~^ we are hampered by 
poor statistics near the endpoint. The best mT2 ratio 
is r(mT2-6Q0/3Q0) in the MET box, computed with the 
LSP mass of CS4d; it is defined as the number of events 
in the MET box with mT2 > 600 GeV divided by the 
number of events with mT2 > 300 GeV. This ratio has 
2.8a significance with 100 pb^^. 

Making the same comparison at 1000 pb~^, the sig- 
nificance of r(MET520), r(HT900) and r(Meffl400) as 
discriminators improves to 7.1, 6.4 and 5.9cr respec- 
tively. More importantly, two of the mT2 ratios, r(mT2- 
600/300) and r(mT2-600/400), now discriminate at bet- 
ter than 6a. We have not attempted to perform a direct 
extraction of the endpoint, but it is clear that the tot2 
ratios can compete with the kinematic ratios as discrimi- 
nators while simultaneously providing more direct infor- 
mation about the underlying 2 — s- 2 parent subprocess. 

Although we can reject CS4d conclusively, some qual- 
itative differences between LM5 and CS4d are not re- 
solved. Model CS4d produces more tops, while LM5 pro- 
duces a lot of Higgs; however Table IXXI shows that the 
number of b jets, b tags and tagged b jets are all about the 
same. Lacking an explicit reconstruction of tops or Higgs, 
we do not discriminate these models based on these fea- 
tures. 



B. LM2p vs LM5 

This is the most difficult pair of look-alikes in our 
study. From Figure [5] we see that the superpartner spec- 
tra are almost identical; the only significant difference is 
that LM2p has a much lighter stau. As a result, LM2p 
events are much more likely to contain t's, while LM5 
events are much more likely to contain W^'s (mostly from 
chargino decays). 

As Table IXXll shows, at the moment of discovery LM5 
cannot be ruled out as the explanation of LM2p "data" . 
Without r tagging, we would not have 3a discrimination 
even with 1000 pb~^; the best we could do is the jet 
multiplicity ratio r(5j)(3j) with 2.8a: LM5 produces more 
high multiplicity jet events after selection, because we can 
get two jets from a W decay in LM5 compared to only 
one hadronic r from a stau decay in LM2p. 
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TABLE XXII: Summary of the best discriminating ratios for model comparisons in Group 2. The models listed in rows are 
taken as simulated data, with either 100 or 1000 pb~^ of integrated luminosity assumed, and uncertainties as described in the 
text. The models listed in columns are then compared pairwise with the "data". In each case, the three(five) best distinct 
discriminating ratios for 100(1000) pb~^ are shown, with the estimated significance. By distinct we mean that we only list the 
best ratio of each type; thus if r(5j)(4j) is listed, then r(4j)(3j) is not, etc. Square brackets denote ratios defined in the DiJet, 
TriJet or Muon20 boxes; all other ratios are defined in the MET box, and r(DiJet), r(TriJet) denotes the ratio of the number 
of events in the DiJet/TriJet boxes to the number in the MET box. The mT2 ratios are computed using the LSP mass of the 
relevant "theory" model, not the "data" model. 
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With our crude t tagging algorithm we manage to dis- 
criminate LM2p and LM5 at 3.1cr with 1000 pb"^ As 
seen in Table IXIXl the LM2p sample has almost three 
times as many hadronic r's that reconstruct as jets than 
does the LM5 sample. This results in four times as many 
jets being correctly tagged as hadronic r's, and almost 
50% more total r tags for LM2p versus LM5. 

The only other ratios that give even 2a discrimina- 
tion are based on the stransverse mass 771^2- The ratio 
r(mT2-400) counts the number of events with mT2 > 400 
GeV divided by the total number of events after selection; 
here mT2 is computed using the LSP mass of the theory 
model LM5. Figure [20l compares the tot2 distributions 
of LM2p and LM5. The shapes of the distributions are 
very similar; the only obvious difference is that there are 
fewer events in the LM5 bins. 



It is not surprising that the shapes are similar, since 
LM2p and LM5 are very similar models with nearly iden- 
tical gluino and squark masses. Thus ratios like r(mT2- 
600/400) should not be good discriminators, and indeed 
they are not. The ratio r(mT2-400), which is a good 
discriminator, is obviously just picking up the fact that 
there are fewer events in the LM5 bins than in the LM2p 
bins. 

Figure [21] compares the 771^2 distributions of LM2p 
subsamples with fixed multiplicity of jets+muons. We 
see that events with higher multiplicity are significantly 
more likely to fail the 300 GeV hemisphere invariant mass 
upper bound that we imposed to make up for the effect 
on mT2 of imperfect hemisphere separation. This makes 
sense since events with higher multiplicity are more likely 
to have mistakes in the hemisphere assignments. 
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FIG. 20: Comparison of the stransverse mass mT2 distribu- 
tions for look-alike models LM2p (solid red line) and LM5 
(dashed blue line). For each model 100,000 events were gen- 
erated then rescaled to 1000 pb~^. 



FIG. 22: Comparison of the stransverse mass mT2 distribu- 
tions for two subsamples of model LM2p: events with neu- 
trinos (solid red line) and events without neutrinos (dashed 
blue line). A total of 100,000 events were generated, rescaled 
to 1000 pb~^, then sorted into the two subsamples. 
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FIG. 21: Comparison of the stransverse mass mT2 distribu- 
tions for model LM2p with a fixed number of reconstructed 
objects (jets or muons): 3 objects (solid red line), 4 objects 
(dashed blue line) and 5 objects (dotted magenta line). For 
each case 100,000 events were generated then rescaled to 1000 
pb-^ 



Thus the discriminating power of r(mT2-400) in this 
case is correlated with r(5j)(3j), not with kinematic ratios 
hke r(HT900) and r(Mefri400). 

It is important to note that the mT2 ratios have some 
ability to discriminate based on neutrinos in the final 
state: Figure [H] shows a comparison of the mT2 dis- 
tributions for LM2p events containing neutrinos versus 
those without neutrinos. The events with neutrinos have 



a softer mT2 distribution, i.e. the subsample with neutri- 
nos is less efficient at populating the tot2 upper cndpoint. 
Models LM2p and LM5 differ greatly in the proportion 
of events after selection that have neutrinos: about 50% 
for LM2p but only about 10% for LM5. The neutrino 
content effect on the mr2 distributions actually reduces 
the discrimination of LM2p versus LM5, because the neu- 
trino effect works in the opposite direction from the dom- 
inant effect of jet multiplicity. 

This example shows that the interpretation of the 
mT2 ratios requires a comparison with other discrimi- 
nators. If the mT2 ratios r(mT2-xxx/yyy) have a high 
significance positively correlated with e.g. r(HT900) and 
r(Meffl400), then the tot2 ratios are predominantly in- 
dicating kinematics. If the mT2 ratios r(mt2-xxx) have 
a high significance but r(mT2-xxx/yyy) do not (as oc- 
curred here), we expect they will be positively corre- 
lated with the jet ratios, indicating a difference in the 
multiplicity of reconstructed objects. If the tot2 ratios 
r(mT2-xxx/yyy) have a high significance uncorrelated or 
negatively correlated with either kinematics or jet mul- 
tiplicity, this could signal the presence of three unseen 
particles (e.g. two LSPs and a neutrino) in the final state 
of a large fraction of events. 

C. CS4d vs LM8 

This is the second most difficult pair of look-alikes in 
our study. From Figure [5] we see that the gluino and 
squark superpartner spectra are roughly similar. The 
gluino masses agree to within 10 GeV; in LM8 the left 
and right squarks are nearly degenerate and slightly heav- 
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ier than the gluino, while in CS4d the right squarks are 
slightly lighter than the gluino and the left-squarks are 
about 180 GeV heavier. 

Both models have about the same fractions of squark 
pair, squark-gluino and gluino pair production after se- 
lection. In both models the gluino decays predominately 
to top-stop. Both models produce many tops, 6's and 
VF's. LM8 also produces a lot of Z''s from decays. 

The main difference between these models is that CS4d 
has a much lighter stop and a much heavier LSP. With 
771(^=352 GeV and m^o=251 GeV a two-body hght stop 
decay cannot occur, and this stop can just barely manage 
the three-body decay ti bW^Xi- 

Consider the case that we perform our look-alike anal- 
ysis taking CS4d as the "data" model and LM8 as the 
theory model. At the moment of discovery, LM8 will 
explain not only the overall size of the signal but also 
its kinematics: with 100 pb~^ we find no kinematic ob- 
servable that discriminates better than r(Meffl400) with 
1.7(7, and even with 1000 pb~^ no kinematic observables 
discriminate at even the 3a level. Given that LM8 is 
an mSUGRA benchmark used at the LHC experiments 
it may be tempting to falsely conclude that LM8 is the 
probable explanation of the discovery! Since LM8 has 
230 GeV charginos, even a preliminary result in this di- 
rection could be used, for example, as a justification to 
start building a 500 GeV linear collider. Since the actual 
chargino mass of the underlying compressed SUSY model 
is 352 GeV, this would lead to embarrassment. 

Fortunately our look-alike analysis gives additional dis- 
criminating handles: 

• The ratio r(Muon20) has a 3.4ct significance with 
1000 pb^^, reflecting a larger fraction of recon- 
structed muons in LM8 events over CS4d. 



efficiency in the b tagging and bad luck in that this ratio 
happens to have a rather large uncertainty from the pdf 
spread. 

These handles exclude LM8 with reasonable confidence 
as the explanation of the "data" . They also give clues 
on how to modify LM8 (still within the hypothesis of 
SUSY) to better fit the data. The Z peak in LM8 comes 
from left-squarks decaying to ! making the left-squarks 
heavier will cause them to decay instead to quark-gluino. 
To keep the parent kinematics and observed yield con- 
stant, this also suggests lowering the right-squark masses. 
This has the added benefit of favoring the 2-body squark 
decay qn — qXi over the decay qn — > qg, which lowers 
the jet multiplicity. The large hemisphere asymmetries 
in LM8 are derived from gluino cascades resulting in two 
top quarks, and thus up to six jets, in a single hemi- 
sphere. The obvious way to reduce this without dras- 
tically changing the model is to squeeze out the phase 
space for the 2-body stop decay ti txi, and further 
squeeze the 3-body stop decay. By thus reducing the 
amount of visible energy reconstructed in the events, this 
reduces the hemisphere asymmetries, the jet multiplicity, 
and the muon counts. 

Thus with 1000 pb^^ we might not only exclude LM8 
but also come close to guessing CS4d from this simple 
look-alike comparison. Without the benefit of additional 
model comparisons this guess would be relying on the 
strong assumptions that the "data" was SUSY (an as- 
sumption that we will relax in the Group 2 analysis) and 
that the data was full of b's and tops despite lacking ex- 
plicit confirmation from b tagging or top reconstruction. 



• With 1000 pb^^ there are enough dimuons to re- 
construct the Z peak as shown in Figure [14] for 
LM8, while no peak appears for CS4d. 

• The ratio r(5j)(3j) in the MET box differs by 2.8cr 
at 100 pb~^, increasing to 4.2(T with 1000 pb^^, 
reflecting a larger jet multiplicity in LM8 events 
versus CS4d. 

• With 1000 pb~^ we also see discrepancies in the 
event shape variables. One of these, r(Hem3), is 
the fraction of events where the object counts in 
the two hemispheres differ by at least 3; the other, 
r(10tdiff-c300), is the fraction of events where the 
track count in a 30° cone around each hemisphere 
axis differs by at least 10. With 1000 ph'^ both 
these ratios have a significance of 3.6a, reflecting 
more symmetrical object counts in CS4d events 
than for LM8. 



K: "I Exp. statistical Error 



From Table IXXI we would have hoped for a significant 
difference in r(6-tag) between LM8 and CS4d. However 
even with 1000 pb"-'^ we obtain only a 2Aa significance 
with this ratio: this is due to the combination of low 
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FIG. 23: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 1000 pb~^, in the comparison of look-alike 
models CS6 and CS4d, with CS6 treated as the "data". 
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FIG. 24: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 1000 pb~^, in the comparison of look-ahke 
models CS6 and LM8, with CS6 treated as the "data". 



FIG. 26: Pdf spreads for discriminating ratios with 1000 pb^ , 
in the comparison of look-alike models CS6 and LM8, with 
CS6 treated as the "data". 
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FIG. 25: Pdf spreads for discriminating ratios with 1000 pb~^, 
in the comparison of look-alike models CS6 and CS4d, with 
CS6 treated as the "data". 



D. CS6 vs LM2p, LM5, LM8 and CS4d 

This is a complete example of how we could deduce the 
correct model, under the assumption of SUSY, based on 
how it fails to match with four incorrect SUSY models. 
We take CS6 as the "data" and compare it to LM2p, 
LM5, LM8 and CS4d. 

With 100 pb-\ the ratio r(mT2-500/300) is much 
smaller for the "data" than for LM2p, LM5, LM8 and 
CS4d, with significance 5.1, 4.6, 3.3 and 2.9a respectively. 
This indicates that the parent particle mass in the data 
model is quite a bit lighter than in model CS4d, which 
has the lightest squarks and gluinos of the four theory 



models. 

To keep the overall data yield constant, we can con- 
template two possible modifications of the spectrum in 
CS4d: either make the gluino lighter and the squarks 
heavier, or vice- versa. 

The other striking result with 100 pb~^ is that the 
"data" has only 1/4 as many events in the Muon20 box 
as the theory model LM8, and 1/3 as many as the theory 
model CS4d. At the same time, the "data" has nearly 
three times more b tags in the Muon20 box than does 
LM8, a 6.5(7 discrepancy. With a little more data the 
same puzzling discrepancy turns up in the comparison 
with CS4d. 

How to explain this? Recall that both models LM8 and 
CS4d produce a large number of 6's and W^s, and LM8 
also has large numbers of Z's. Events ending up in the 
Muon20 box have hard muons from either W/Z decays 
or from energetic cascades with semileptonic B decays in 
jets. If we removed all of the W's and Z's we would have 
many fewer events in the Muon20 box. However precisely 
in this case all of the events in the Muon20 box will have 
hard muons inside b jets. This would then explain a high 
b tagging rate in this box combined with a smaller overall 
count. 

Already with 100 pb~^ we have strong clues that the 
data model has a light gluino, heavy squarks, and that 
the gluino decays do not involve W^s, Z's or sleptons. 
Table 1X1X1 shows that the "data" has only 59 r tags 
per 1000 pb~^; this gives a 6a deficiency in t tags for 
the data relative to model LM2p after 1000 pb"^ The 
lack of T tags indicates we are not making a lot of r's, 
but if we are not making either W's or t's then we are 
probably not making charginos from gluino decays. This 
suggests that the mass splitting between the gluino and 
the chargino is relatively small, and that the gluino has 
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a three-body decay. A three-body decay mediated by a 
virtual chargino would imply more muons, so we a led 
to the three-body decay 5— > qqxi mediated by a virtual 
squark. 

Putting it all together (within the hypothesis of SUSY) 
leads to a model like CS6, with heavy squarks, a light 
gluino, and a compressed gaugino spectrum. Production 
is dominated by gluino pairs, and gluino decays are dom- 
inated by the three-body mode to qqXi- The only muons 
are from semileptonic B decays. 

This scenario also makes predictions that can be tested 
with more data. For example, the hemisphere counts 
should be quite symmetrical, since we are almost always 
producing a pair of the same particles with the same 
decays. Indeed this prediction is borne out with 1000 
pb~^, where both r(Hem2) and r(Hcm3) are > Ga smaller 
for CS6 than for LM8. 

These hemisphere ratios also demonstrate the impor- 
tance of the pdf uncertainties. Figures [23l and [24l show 
the breakdown of the uncertainties for some of the dis- 
criminating ratios with 1000 pb~^ in the comparisons of 
CS6 "data" with models CS4d and LM8. For CS4d the 
r(Hem2) ratio, which discriminates at 4.6cr, is systemat- 
ics limited, while the r(Hem3) ratio still has a rather 
large statistical uncertainty. For LM8 we notice that 
the theory systematic is the largest uncertainty for both 
r(Hem2) and r(Hem3). 

These differences are explained by Figures [53] and 
which show the spread in the values of the ratios as we 
vary the parton distribution functions used in the simula- 
tion. Note that the pdf spreads are model dependent: the 
spreads for the hemisphere ratios r(Hem2) and r(Hem3) 
are twice as large for LM8 as they are for CS4d. This 
explains why the total theory systematic for these ratios 
is larger for LM8 than for CS4d. However because of 
better statistics the hemisphere ratios appear to discrim- 
inate better for CS6 vs LM8 than they do for CS6 vs 
CS4d. The caveat is that the validity of this statement 
depends crucially on whether our estimates of the pdf 
uncertainties are at least roughly accurate. 



VIII. SUMMARY OF GROUP 2 RESULTS 

Table IXXIII summarizes the results from the models of 
Group 2. There are 6 pairwise model comparisons. One 
model is taken as the simulated data, with the number 
of signal events scaled to either the 100 pb^^ discovery 
data set or a 1000 pb~^ follow-up. In our analysis we 
include the correct statistical uncertainty arising from the 
assumed 100 pb^^ or 1000 pb^^ of integrated luminosity 
in the "data" sample, as described in Section IVTl B. 

The most remarkable feature of these results is that 
we achieve greater than 4(t discrimination of non-SUSY 
model LH2 from its SUSY look-ahkes NM4 and CS7, al- 
ready at the moment of discovery. With the larger 1000 
pb~^ data set, we achieve > 5a discrimination in every 
case, for more than five distinct ratios per comparison. 



Thus even with small data sets we can both distinguish 
SUSY and non-SUSY explanations of the same excess 
and have multiple handles to inform us about key prop- 
erties of the true model behind the data. 

To see how this works in detail, let us take LH2 as our 
"data" . Suppose that we lived in a world where particle 
theorists believed that any missing energy signal has to 
be explained by SUSY (until recently we did in fact live in 
such a world) . Then clever theorists might construct the 
SUSY model NM4 shown in Figure [S] as an explanation 
of the missing energy discovery. 

Applying our look-alike analysis, however, one detects 
a problem already with 100 pb~^. The mT2 ratio r(mT2- 
500) (computed using the NM4 LSP mass) is three times 
larger for the "data" than for the SUSY model NM4, 
a nearly 5a discrepancy. As seen in Table IXXIIIl this is 
positively correlated with the mT2 ratio r(mT2-500/300), 
which is more than twice as large for the "data" as for 
NM4, a 3(7 discrepancy, and uncorrelated with the jet 
multiplicity, which shows no significant difference. 

These results strongly suggest that the true model un- 
derlying the "data" has heavier parent particles than 
does SUSY model NM4. However the SUSY enthusiast 
will be quite confused by this conclusion, since the kine- 
matic ratio r(Meffl400) is more than two times greater 
for NM4 than it is for the "data" , a 3a discrepancy that 
seems to directly contradict our previous conclusion. A 
conservative SUSY enthusiast might choose to wait for 
more data, but this will only reinforce the confusion; with 
1000 pb-i the ratio r(mT2-500/300) indicates with 8.5cr 
significance that the NM4 squarks are too light, while 
r(Meffl400), r(M1400) and r(HT900) all indicate at > 5a 
that SUSY NM4 events are too energetic! 

At some point our SUSY enthusiast may decide to re- 
place SUSY model NM4 with SUSY model CS7. This is 
a bright idea since the parent production in CS7 is all 
gluinos, instead of all squarks as in NM4, and the gluino 
kinematics are naturally softer then squark kinematics, 
for comparable masses. Thus, as seen in Table IXXIVl 
CS7 matches both the kinematics and the overall yield 
of the "data" much better than NM4: even with 1000 
pb-i r(Meffl400), r(M1400) and r(HT900) have no dis- 
crepancies as large as 3a. 

However our SUSY enthusiast still has serious prob- 
lems. Table IXXIVl shows that now the E^''^^ distri- 
butions are way off: even with 100 pb~^ the ratio 
r(MET420) is more than twice as big for the "data" as 
for CS7, while r(MET520) is three times as big; these are 
both > 5a discrepancies. Furthermore the jet multiplic- 
ities don't match: the ratio r(4j)(3j) is almost twice as 
large for CS7 as for the "data" , a Aa discrepancy with 
100 ph-\ 

Figures [771 and demonstrate the robustness of these 
results, by showing the breakdown of the experimental 
and theoretical uncertainties for the relevant ratios. With 
the exception of r(4j)(3j), the uncertainties on all of the 
ratios that we have been discussing are completely dom- 
inated by the low statistics of our small "data" sample. 
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Thus, for example, doubling the pdf uncertainties would 
not alter any of the conclusions reached above. 

It is not obvious that our SUSY diehard can fix up a 
SUSY candidate to falsely explain the non-SUSY "data" , 
while surviving the scrutiny of our look-alike analysis. 
This applies even for small data sets on the order of a 
few hundred inverse picobarns. The key observation is 
that although SUSY models have many adjustable pa- 
rameters, the number of adjustable parameters relevant 
to this look-alike analysis is small compared to the num- 
ber of robust discriminators. 



LH2 vs. NM4 [100 pb^^ 



Variable 



LH2 NM4 Separation 



MET 



r(mT2-500) 


0.16 


0.05 


4.87 


r(mT2-400) 


0.44 


0.21 


4.84 


r(mT2-300) 


0.75 


0.54 


3.49 


r(Mefn400) 


0.11 


0.25 


2.99 


r(mT2-500/300) 


0.21 


0.09 


2.98 


r(M1400) 


0.07 


0.19 


2.69 


r(mT2-400/300) 


0.58 


0.40 


2.48 


r{HT900) 


0.13 


0.24 


2.34 


r(MET420) 


0.48 


0.37 


2.00 


r(mT2-500/400) 


0.36 


0.22 


1.47 



TABLE XXIII: Best discriminating ratios in the MET box, 
with separations in units of a, for the comparison of LH2 
vs. NM4, taking LH2 as the "data", assuming an integrated 
luminosity of 100 pb~^. 



LH2 vs. CS7 [100 pb' 



Variable 


LH2 CS7 Separation 


MET 


r(mT2-500) 


0.27 0.08 


6.68 


r(MET420) 


0.48 0.20 


6.49 


r(MET520) 


0.21 0.07 


5.06 


r(MET320) 


0.78 0.53 


4.29 


r(mT2-500/300) 


0.32 0.12 


4.24 


r(4j)(3j) 


0.36 0.61 


4.04 


r(mT2-400) 


0.63 0.40 


4.00 


r(mT2-300) 


0.85 0.62 


3.55 


r(mT2-500/400) 


0.43 0.19 


3.52 


r(Heml) 


0.79 0.63 


2.59 



TABLE XXIV: Best discriminating ratios in the MET box, 
with separations in units of a, for the comparison of LH2 
vs. CS7, taking LH2 as the "data", assuming an integrated 
luminosity of 100 pb~^. 
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FIG. 27: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 100 pb~^, in the comparison of look-alike 
models LH2 and NM4, with LH2 treated as the "data". 
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FIG. 28: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 100 pb~^, in the comparison of look-alike 
models LH2 and CS7, with LII2 treated as the "data" . 



IX. DISCUSSION AND OUTLOOK 

We have presented a concrete strategy for determining 
the underlying theory model of an early missing energy 
discovery at the LHC. Applying this look-alike analysis to 
a realistic simulation, we were able to distinguish a non- 
SUSY model from its SUSY look-alikes essentially at the 
moment of discovery, with little more than 100 pb~^ of 
integrated luminosity. In 23 of 26 pairwise comparisons, 
mostly SUSY with SUSY, we were able to discriminate 
look-alikes at better than 5a significance with at least one 
robust observable and 1000 pb~^ or less of integrated lu- 
minosity. Even in the three cases with the worst discrim- 
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ination we found strong hints of the key properties of the 
underlying model; these would be confirmed with more 
data and/or by our improving the look-alike analysis. 

One surprise of our study (at least to us) was the sen- 
sitivity and robustness of the ratios based on the strans- 
verse mass 771^2- Keep in mind that we did not apply 
the mT2 distributions to their originally intended use i.e. 
extracting masses from endpoints and kinks, and we ap- 
plied our mT2 ratios to data sets 100 times smaller than 
used in previous studies. Nevertheless we found that the 
mT2 ratios are among our best discriminators. One of the 
most important features of the mT2 ratios is that to first 
approximation they do not depend on the spins of the 
parent particles. Since ratios based on more traditional 
kinematic distributions like Hx and M^g have a large de- 
pendence on the spins of the parent particles, comparing 
171x2 ratios to these ratios is a powerful discriminator for 
spin. 

Our main goal in this study was to develop a look- 
alike analysis for missing energy that can be successfully 
applied to the LHC data in the first year of physics run- 
ning. The crucial properties of such an analysis are re- 
alism, robustness, validation and sensitivity. We briefly 
summarize where we stand with respect to establishing 
these properties. 



Realism 

We have employed state-of-the-art event generation for 
the missing energy signals, but only at leading order in 
each subprocess. In the next phase we include the possi- 
bility of extra high Et jets at the matrix element level. 
We are performing this study within the CMS collabora- 
tion and hence replace our fast simulation with the full 
CMS detector simulation. This also allows us to include 
the Standard Model backgrounds in the full analysis, re- 
placing the background subtraction assumed here. 



Robustness 

Our analysis is already quite robust against disappoint- 
ments in the performance of the LHC detectors during 
the first physics run. We have assumed a minimal palette 
of reconstructed objects and triggers. Because our analy- 
sis uses only ratios of correlated inclusive counts, there is 
a large cancellation of theoretical and experimental un- 
certainties, and we can make simple apples-with-apples 
comparisons between different observables. 

Despite the fact that we are considering small data 
sets, we have not employed any multivariate statistical 
methods. As mentioned earlier such methods are left for 
the era of demonstrated understanding of the correlations 
between observables. By dispensing with these methods 
we lose sensitivity but gain a cleaner more robust analy- 
sis. 



We gain additional robustness from the physics redun- 
dancy built into our choice of correlated observables. For 
example, jet multiplicity is correlated with track counts 
in the cones, and sometimes with the ratios of 771x2 counts 
to total number of events in the box. The hemisphere 
ratios are correlated with the difference counts in the 
cones. Muon counting is correlated with b tagging. The 
four trigger boxes provide us with four complete sets of 
ratios, allowing further comparisons and cross-checks. 

Our main deficiency in robustness is the limited num- 
ber of theory models simulated for this study. In the next 
phase we are including a much larger number and variety 
of models. 

A possible approach to expanding this analysis is to ap- 
ply the idea of OSETs [SB], as a strategy for effectively 
sampling the entire theory space. A disadvantage of this 
approach is that, by definition, we give up our spin sen- 
sitivity and more generally any discrimination based on 
details of the matrix elements. 



Validation 

No studies performed to date of the LHC phenomenol- 
ogy of any BSM theory have adequately validated un- 
certainties. The experimental uncertainties cannot be 
sufficiently validated until we have LHC data. The the- 
oretical uncertainties could be validated sooner, but this 
will require many more detailed studies adhering to at 
least the degree of realism attempted here. 

In the next phase of this analysis (currently being per- 
formed within the CMS collaboration) we are validating 
the Standard Model backgrounds of the inclusive missing 
energy signature using the CMS full simulation frame- 
work. This, among other things, is allowing us to com- 
pute the backgrounds in each of our trigger boxes and 
study the effect of varying the selection criteria, e.g. re- 
laxing or modifying the ILV. 

The full detector simulation itself needs to be validated 
against real LHC data, using these same Standard Model 
processes as benchmarks. Similar comments apply to 
the parton distribution functions. In both cases we need 
to develop a more sophisticated parametrization of the 
uncertainties. 

The event generation chains used in our study, while 
state-of-the-art, have not been adequately validated; we 
have performed several cross-validation checks and ob- 
tained significant discrepancies. This is an important 
task for the entire LHC theory community. 

Sensitivity 

We have demonstrated the importance of being able to 
obtain subsamples of the discovery data set enriched in 
t's and 6's. To do this successfully will require a different 
approach to flavor tagging in the early LHC running, an 
approach that emphasizes robustness and fast validation 
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over efficiency and purity. It seems likely that a dedicated 
effort could achieve results as good as our preliminary 
study, and possibly much better. 

Wc have seen that one of the virtues of the mT2 ratios 
is sensitivity to the number of weakly interacting par- 
ticles in the final state. It is important to study this 
further, along with the potential to extract estimates of 
the masses of parent particles and the LSP from small 
data sets. 



Dark matter at 100 pb"^ 

We have demonstrated a concrete strategic solution 
to the LHC Inverse Problem applicable under realistic 
conditions to early physics running at the LHC. Since the 
missing energy signature is motivated by the existence 
of dark matter, we should also address what might be 
called the Dark Matter Inverse Problem: given a missing 
energy discovery at the LHC, what can we learn about 
dark matter and the cosmological events that produced 
it? This problem has not been addressed at all for the 
100 pb^^ era of LHC running. Given an early missing 
energy discovery at the LHC, this problem will become 
one of the most interesting questions in particle physics, 
especially tied in to results from the ongoing direct and 
indirect dark matter searches. 



LM5 


vs. CS4d [100 pb"^] 




Variable 


LM5 CS4d Separation 


MET 


r(HT900) 


0.38 0.24 


3.59 


r(Meffl400) 


0.32 0.21 


3.15 


r(MET520) 


0.28 0.19 


3.12 


DiJot 


r(DiJet) 


0.22 0.15 


2.37 


r(MET520) 


0.28 0.23 


0.56 


r(Hem2) 


0.27 0.32 


0.51 


TriJct 


r(TriJet) 


0.22 0.17 


1.89 


r(Meffl400) 


0.65 0.59 


0.62 


r(HT900) 


0.73 0.67 


0.59 


Muon20 


r(HT900) 


0.35 0.25 


1.17 


r(mT2-300) 


0.27 0.36 


1.00 


r(mT2-400) 


0.26 0.34 


0.93 



TABLE XXV: Largest separation (in units of a) for the com- 
parison of LM5 vs. CS4d (error on CS4d) assuming an inte- 
grated luminosity of 100 pb^^. 



LM5 vs. 


CS4d [100 pb" 




Variable 


LM5 


CS4d 


Separation 


MET 


r(HT900) 


0.38 


0.24 


3.59 


r(Mefri400) 


0.32 


0.21 


3.15 


r(MET520) 


0.28 


0.19 


3.12 


r(mT2-600/300) 


0.29 


0.17 


2.83 


r(mT2-600/400) 


0.31 


0.18 


2.82 


r(mT2-600/500) 


0.46 


0.30 


2.22 


r(MET420) 


0.49 


0.40 


2.14 


r(mT2-300) 


0.36 


0.45 


1.98 


r(mT2-400) 


0.34 


0.43 


1.87 


r(mT2-600) 


0.11 


0.08 


1.47 



TABLE XXVI: Largest separation (in units of a) for the com- 
parison of LM5 vs. CS4d (error on CS4d) assuming an inte- 
grated luminosity of 100 pb~^. 



LM5 vs. 


CS4d [1000 pb- 




Variable 


LM5 


CS4d 


Separation 


MET 


r(MET520) 


0.28 


0.19 


7.07 


r(HT900) 


0.38 


0.24 


6.41 


r(mT2-60n/4nn) 


0.31 


0.18 


6.10 


Di.k't 


r(DiJet) 


0.22 


0.15 


4.80 


r(mT2-600/300) 


0.29 


0.15 


2.69 


r(mT2-600/400) 


0.30 


0.16 


2.69 


TriJet 


r(TriJet) 


0.22 


0.17 


3.50 


r(MET520) 


0.28 


0.24 


1.31 


r(Meffl400) 


0.65 


0.59 


1.29 


Muon20 


r(mT2-600/300) 


0.23 


0.13 


3.04 


r(mT2-600/400) 


0.25 


0.14 


2.98 


r(HT900) 


0.35 


0.25 


2.63 



TABLE XXVII: Best diseriminating ratios in each trigger box, 
with separations in units of a, for the comparison of LM5 
vs. CS4d, taking LM5 as the "data", assuming an integrated 
luminosity of 1000 pb"'^. 
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LM5 vs. 


CS4d 


1000 pb 


"1 


Variable 


LM5 


CS4d 


Separation 


MET 


r(MET520) 


0.28 


0.19 


7.07 


r(HT900) 


0.38 


0.24 


6.41 


r(mT2-600/400) 


0.31 


0.18 


6.10 


r(mT2-600/300) 


0.29 


0.17 


6.04 


r(Mefn400) 


0.32 


0.21 


5.94 


r(mT2-600/500) 


0.46 


0.30 


4.88 


r(MET420) 


0.49 


0.40 


3.82 


r(mT2-600) 


0.11 


0.08 


3.52 


r(mT2-300) 


0.36 


0.45 


3.12 


r(mT2-400) 


0.34 


0.43 


3.02 



I I Exp. Statistical Error 

I I Exp. Systematic Error 

I I Teo. Statistical Error 

I I Teo. Systematic Error 




o 
o 

CO 
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o 
•a- 
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o 
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TABLE XXVIII: Best discriminating ratios in the MET box, 
with separations in units of a, for the comparison of LM5 
vs. CS4d, taking LM5 as the "data", assuming an integrated 
luminosity of 1000 pb~^. 



LM2p vs. LM5 [100 pb~^] 
Variable LM2p LM5 Separation 
MET 



r(5j)(3j) 


0.33 


0.40 


1.64 


r(mT2-300) 


0.41 


0.34 


1.44 


r(mT2-400) 


0.30 


0.25 


1.34 


r(4j)(3j) 


0.64 


0.69 


1.18 


r(r-tag) 


0.07 


0.05 


1.17 


r(10t-c45) 


0.30 


0.36 


1.14 


r(10t-c30) 


0.16 


0.20 


1.13 


r(10t-c75) 


0.48 


0.53 


1.07 


r(10t-c60) 


0.41 


0.47 


1.07 


r(mT2-500) 


0.16 


0.13 


1.05 



TABLE XXIX: Best discriminating ratios in the MET box, 
with separations in units of a, for the comparison of LM2p 
vs. LM5, taking LM2p as the "data", assuming an integrated 
luminosity of 100 pb~^. 
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APPENDIX A: DETAILED RESULTS FOR 
GROUP 1 

1. LM5 vs CS4d 



This comparison is described in section IVII Al We 
treat model LM5 as the data and CS4d as the the- 
ory model. For 100 pb~^ of integrated luminosity, Ta- 



FIG. 29: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 100 pb~^, in the comparison of look-alike 
models LM2p and LM5, with LM2p treated as the "data". 
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FIG. 30: Pdf spreads for discriminating ratios with 100 pb~ , 
in the comparison of look-alike models LM2p and LM5, with 
LM2p treated as the "data". 



ble I XXVI shows the three best discriminating ratios as 
defined in each of the trigger boxes: MET, DiJet, TriJet 
and Muon20. Table IXXVII shows the ten best discrimi- 
nating ratios in the MET box. The separation in units of 
a is computed from the total estimated theoretical and 
experimental uncertainties as described in section IVTl 

The same information for 1000 pb~^ is shown in 
Tables IXXVIII and IXXVIIII The notation r(DiJet), 
r(TriJet) and r(Muon20) denotes the ratio of the number 
of events in that box with the number of events in the 
MET box. Since (plus or minus about one event in 1000 
pb-i) the DiJet, TriJet and Muon20 boxes are subsam- 
ples of the MET box, these are ratios of inclusive counts. 
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LM2p vs. LM5 MET [100pb] 



CS4d vs. LM8 [100 pb~ 



r(10t-c75) 
r(10t-c60) 
r(10t-c45) 
r(10t-c30) 
r(mT2-500) 
r(mT2-400) 
r(nnT2-300) 
r(T-tag) 
r-(5j)(3j) 
r-(4i)(3i) 



Variable 



CS4d LM8 Separation 



MET 



r(5j)(3j) 


0.40 


0.54 


2.83 


r(mT2-300) 


0.41 


0.32 


2.14 


r(4j)(3j) 


0.70 


0.81 


2.11 


r(10t-c30) 


0.20 


0.28 


1.93 


r(20t-c60) 


0.16 


0.23 


1.78 


r(20t-c45) 


0.08 


0.13 


1.73 


r(10t-c30)(10t-c90) 


0.34 


0.43 


1.72 


r(10t-c45) 


0.36 


0.46 


1.70 


r(30t-c90) 


0.11 


0.17 


1.68 


r(Meffl400) 


0.21 


0.28 


1.65 



10-^ 



10 

Number of o 



FIG. 31: Pulls of the best discriminating ratios with 100 pb~^, 
in the comparison of look-alike models LM2p and LM5, with 
LM2p treated as the "data" . 



LM2p 


vs. LM5 [1000 pb"^] 


Variable 


LM2p LM5 Separation 


MET 



r(r-tag) 


0.07 


0.05 


3.14 


r(5j)(3j) 


0.33 


0.40 


2.77 


r(mT2-400) 


0.30 


0.25 


2.56 


r(mT2-500) 


0.16 


0.13 


2.53 


r(mT2-300) 


0.41 


0.34 


2.27 


r(10t-c30) 


0.16 


0.20 


1.67 


r(MET520) 


0.31 


0.28 


1.67 


r(20t-c45) 


0.07 


0.09 


1.61 


r(10tdiff-c30) 


0.14 


0.16 


1.61 


r(4j)(3j) 


0.64 


0.69 


1.56 



TABLE XXX: Best discriminating ratios in the MET box, 
with separations in units of a, for the comparison of LM2p 
vs. LM5, taking LM2p as the "data" , assuming an integrated 
luminosity of 1000 pb~^. 



TABLE XXXI: Best discriminating ratios in the MET box, 
with separations in units of cr, for the comparison of CS4d 
vs. LM8, taking CS4d as the "data", assuming an integrated 
luminosity of 100 pb~^. 



CS4d vs. LM8 [1000 pb~ 



Variable 



CS4d LM8 Separation 



MET 



r(5j)(3j) 


0.40 


0.54 


4.21 


r(10tdiff-c30) 


0.15 


0.20 


3.63 


r(Hem3) 


0.07 


0.11 


3.58 


r(mT2-300) 


0.41 


0.32 


3.43 


r(mT2-400) 


0.25 


0.20 


3.24 


r(20t-c30) 


0.02 


0.04 


3.09 


r(20tdiff-c45) 


0.06 


0.08 


3.05 


r(20tdiff-c60) 


0.12 


0.16 


3.04 


r(20t-c45) 


0.08 


0.13 


3.03 


r(10t-c30)(10t-c90) 


0.34 


0.43 


2.95 



TABLE XXXII: Best discriminating ratios in the MET box, 
with separations in units of cr, for the comparison of CS4d 
vs. LM8, taking CS4d as the "data", assuming an integrated 
luminosity of 1000 pb~^. 



2. LM2p vs LM5 

This comparison is described in section IVII Bl We 
treat model LM2p as the data and LM5 as the theory 
model. For 100 pb~^, the ten best discriminating ra- 
tios are listed in Table IXXIXI and the pulls are displayed 
in Figure [3TJ While the r tag ratio is not discriminat- 
ing well. Figure [29] shows that this is due entirely to poor 
statistics in this small simulated data sample: the experi- 
mental statistical uncertainty is 30%, compared to exper- 
imental and theory systematics both estimated at around 
5%. The small theory statistical uncertainty shown is the 
error from the finite Monte Carlo statistics in simulating 
the theory model. Table IXXXI shows the improvement 



in the discriminating power of r(r-tag) with 1000 pb^-'^, 
due to the reduction in the experimental statistical un- 
certainty. Note that in this difhcult case no other ratio 
discriminates with better than 3a significance. 

In Figure [51] one also notices large differences in the 
relative size of the theory systematics for the different 
ratios. These are due to large differences in the spread 
of values when we vary the parton distribution functions 
used in the simulation, as shown in Figure [301 Notice 
that the pdf spreads vary from less than 5% for the jet 
multiplicity ratio r(4j)(3j) to greater than 20% for the 
cone track count ratio r(10t-c30). 

This is an important generic feature of our results. We 
find the pdf uncertainties to be process dependent and 
thus model dependent. We find also that the relative pdf 
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FIG. 32: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 100 pb~^, in the comparison of look-alike 
models CS4d and LM8, with CS4d treated as the "data". 
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FIG. 33: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 100 pb~^, in the comparison of look-alike 
models CS4d and LM8, with CS4d treated as the "data". 



CS6 vs. 


LM2p [100 pb 




Variable 


CS6 LM2p 


Separation 


MET 


r(MET420) 


0.22 


0.53 


6.99 


r(MET520) 


0.09 


0.31 


6.10 


r(mT2-500/300) 


0.08 


0.39 


5.14 


DiJet 


r(DiJet) 


0.11 


0.21 


3.04 


r(5j)(3j) 


0.54 


0.32 


3.00 


r(4j)(3j) 


0.83 


0.62 


2.56 


TriJet 


r(MET420) 


0.29 


0.53 


2.96 


r(MET320) 


0.55 


0.77 


2.84 


r(HT900) 


0.57 


0.75 


2.46 


Muon20 


r(&-tag) 


0.74 


0.36 


4.25 


r(Muon20) 


0.06 


0.14 


3.14 


r(MET420) 


0.23 


0.48 


2.55 



TABLE XXXIII: Best discriminating ratios in each trigger 
box, with separations in units of a, for the comparison of CS6 
vs. LM2p, taking CS6 as the "data", assuming an integrated 
luminosity of 100 pb^^. 



CS6 vs. 


LM2p [100 pb" 




Variable 


CS6 


LM2p 


Separation 


MET 


r(MET420) 


0.22 


0.53 


6.99 


r(MET520) 


0.09 


0.31 


6.10 


r(mT2-500/300) 


0.08 


0.39 


5.14 


r(mT2-500/400) 


0.17 


0.54 


5.11 


r(MET320) 


0.54 


0.78 


4.93 


r(HT900) 


0.18 


0.38 


4.80 


r(mT2-500) 


0.03 


0.16 


4.56 


r(Meffl400) 


0.16 


0.32 


3.91 


r(mT2-400/300) 


0.49 


0.73 


3.88 


r(mT2-400/300) 


0.49 


0.73 


3.88 



uncertainties for different ratios in the same model vary 
by factors as large as 4 or 5. 



3. CS4d vs LM8 

This comparison is described in section IVII CI We 
treat model CS4d as the data and LM8 as the theory 
model. Tables IXXXII and IXXXIII show the ten best dis- 
criminating ratios for 100 pb^^ and 1000 pb~^ respec- 
tively. We observe that the best discriminator with 100 
pb~^, the jet multiplicity ratio r(5j)(3j), remains the best 
with 1000 pb~^. However the second and third best dsi- 
criminators with 1000 pb""'^, r(10tdiff-c30) and r(Hem3), 
are not even among the top ten best with 100 pb^^. 



TABLE XXXIV: Best discriminating ratios in the MET box, 
with separations in units of ct, for the comparison of CS6 
vs. LM2p, taking CS6 as the "data", assuming an integrated 
luminosity of 100 pb~^. 



These again are generic features of our results. Fig- 
ures |32] and [33] show that with 100 pb~^ r(5j)(3j) already 
has a statistical uncertainty comparable to the estimated 
systematics, while many other ratios still have large 15 to 
20% statistical uncertainties. This includes r(10tdiff-c30) 
and r(Hem3), ratios that are sensitive to what fraction of 
events have large hemisphere differences in object counts 
or track counts. Thus qualitatively new features of the 
events emerge automatically as good discriminators as 
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^H] Exp. statistical Error 
['"^'"1 Exp. Systematic Error 
Teo. Statistical Error 
Teo. Systematic Error 




FIG. 34: Breakdown of estimated uncertainties for discrimi- 
nating ratios with 100 pb~^, in the comparison of look-ahke 
models CS6 and LM2p, with CS6 treated as the "data". 



see that two i?™'"" ratios and two 771^2 ratios already dis- 
criminate at better than 5(t. 

Figure [M] shows the breakdown of the uncertainites 
for some of the ratios in the this comparison. Observe 
that the kinematic ratio r(Meffl400) has a smaller ex- 
perimental statistical uncertainty than do the 777.^2 ra- 
tios r (mT2-50 0/300) and r(mT2-500/400); nevertheless 
Table IXXXlVl shows that r(Mefri400) is a worse discrim- 
inator. 

This is a generic feature of our results. The distribu- 
tions in Moff are rather broad, whereas the r7iT2 distribu- 
tions are steeply falling as one approaches the endpoint 
region. As seen in Figure [351 when the parent particle 
mass differences are large, 771^2 is an intrinsically good 
discriminator even for quite small data samples within 
very modest resolutions. 

Tables IXXXVl and IXXXVll show the best discriminat- 
ing ratios for the comparison of LM8 ans CS4d to the 
CS6 "data" with 100 pb~^. These results show the im- 
portance of the ratios in the Muon20 box. 
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FIG. 35: Comparison of the mT2 distribution of the CS6 
"data" (solid red line) to that of the theory model LM2p 
(dashed blue line) for 100 pb~^. Here mT2 is computed using 
the LSP mass of the theory model LM2p. 



the integrated luminosity goes up, without changing the 
design of the look-alike analysis. 



CS6 vs 


LM8 [ICQ pb"i 




Variable 


CS6 LM8 Separation 


MET 


r(MET420) 


0.22 0.39 


4.03 


r(Hem2) 


0.19 0.34 


3.60 


r(MET520) 


0.09 0.20 


3.45 


DiJet 


r(DiJet) 


0.11 0.18 


2.21 


r(Hem2) 


0.28 0.35 


0.87 


r(MET320) 


0.59 0.66 


0.86 


TriJet 


r(Meffl400) 0.50 0.64 


1.58 


r(HT900) 


0.57 0.68 


1.35 


r(MET320) 


0.55 0.66 


1.21 


Muon20 


r(&-tag) 


0.74 0.29 


6.45 


r(Muon20) 


0.06 0.24 


5.19 


r(MET420) 


0.23 0.36 


1.78 



TABLE XXXV: Best discriminating ratios in each trigger box, 
with separations in units of a, for the comparison of CS6 
vs. LM8, taking CS6 as the "data", assuming an integrated 
luminosity of 100 pb~^. 



4. CS6 vs LM2p, LM5, LM8 and CS4d 

This comparison is described in section IVII Dl We 
treat model CS6 as the data and compare to theory mod- 
els LM2 p, LM5, LM8 and CS4d. Tables IXXXIIII and 
IXXXIVI show the best discriminating ratios for the com- 
parison of LM2p to the CS6 "data" with 100 pb'^ We 



APPENDIX B: DETAILED RESULTS FOR 
GROUP 2 

1. LH2 vs NM4 and CS7 

This comparison is described in section IVIIII We treat 
the little Higgs model LH2 as the data and compare to 
SUSY models NM4 and CS7. Tables IXXXVIUlXLl show 
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CS6 vs. CS4d [100 pb" 


"1 


LH2 vs. 


NM4 [1000 pb"^] 




Variable CS6 CS4d Separation 


Variable 


LH2 NM4 Separation 


MET 






MET 




r(MET420) 0.22 0.40 


4.25 


r(mT2-500) 


0.16 0.05 


14.11 


r(MET320) 0.54 0.70 


3.26 


r(mT2-400) 


0.44 0.21 


11.13 


r(MET520) 0.09 0.19 


3.15 


r(mT2-500/300) 0.21 0.09 


8.52 


DiJet 






DiJet 




r(5j)(3j) 0.54 0.39 


1.53 


r(mT2-400) 


0.32 0.12 


7.89 


r(DiJet) 0.11 0.15 


1.42 


r(mT2-300) 


0.64 0.32 


7.79 


r(4j)(3j) 0.83 0.69 


1.41 


r(DiJet) 


0.11 0.22 


5.94 


TriJet 






TriJct 




r(MET320) 0.55 0.70 


1.63 


r(mT2-300) 


0.62 0.19 


10.96 


r(MET420) 0.29 0.43 


1.53 


r(mT2-400) 


0.34 0.07 


10.91 


r(HT900) 0.57 0.67 


1.17 


r(TriJet) 


0.06 0.15 


5.94 


Muon20 






Muon20 




r(Muon20) 0.06 0.18 


3.95 


r(mT2-400) 


0.38 0.14 


5.03 


r(MET320) 0.54 0.69 


1.73 


r(mT2-300) 


0.72 0.42 


4.30 


r(MET420) 0.23 0.37 


1.59 


r(Mefn400) 


0.10 0.34 


3.50 



TABLE XXXVI: Best discriminating ratios in each trigger 
box, with separations in units of a, for the comparison of CS6 
vs. CS4d, taking CS6 as the "data", assmning an integrated 
luminosity of 100 pb~^. 



TABLE XXXVII: Best discriminating ratios in each trigger 
box, with separations in units of a, for the comparison of LH2 
vs. NM4, taking LH2 as the "data", assuming an integrated 
luminosity of 1000 pb~^. 



the best discriminating ratios for the comparison of NM4 
and CS7 to the LH2 "data" with 1000 pb^i. 

With the better statistics of 1000 pb"\ Ta- 
ble IXXXVlIIl illustrates even more clearly the conun- 
drum discussed in section IVIIII The mT2 ratio r(mT2- 
500/300) shows unquestionably that the parent particle 
masses in model NM4 are too small to fit the "data". 
The E^^^^ distribution of NM4 also appears to be too 
soft, with 4.3cr significance. However the kinematic dis- 
tributions for NM4 represented by r(Meffl400), r(M1400) 
and r(HT900) are all too hard, with > 5cr significance. 

The other impressive feature of these tables is that 
with 1000 pb~^ we acquire several highly discriminating 
ratios in the DiJet and TriJet boxes. With real data this 
would provide an impressive redundancy of cross-checks, 
still within the original design of our look-alike analysis. 

The large number of independent highly discriminating 
robust ratios seen here provide a powerful tool to resolve 
SUSY look-alikes from non-SUSY look-alikes. 



LH2 vs. NM4 [1000 pb~^] 
Variable LH2 NM4 Separation 



MET 



r(mT2-500) 


0.16 


0.05 


14.11 


r(mT2-400) 


0.44 


0.21 


11.13 


r(mT2-500/300) 


0.21 


0.09 


8.52 


r(Mefn400) 


0.11 


0.25 


7.24 


r(M1400) 


0.07 


0.19 


6.57 


r(mT2-300) 


0.75 


0.54 


6.26 


r(mT2-400/300) 


0.58 


0.40 


5.77 


r(HT900) 


0.13 


0.24 


5.67 


r(M1800) 


0.02 


0.07 


4.82 


r(MET420) 


0.48 


0.37 


4.32 



TABLE XXXVIII: Best discriminating ratios in the MET 
box, with separations in units of a, for the comparison of LH2 
vs. NM4, taking LH2 as the "data", assuming an integrated 
luminosity of 1000 pb~^. 



APPENDIX C: COMPARISON OF SQUARK 
PRODUCTION WITH HEAVY QUARK 
PRODUCTION 

1. smuon production versus muon production 

Let's compare the QED processes e+e^ l^^l^-^ and 
e~^e~ — !■ jj-BP-B- We will use the conventions and notation 
of Peskin and Schroeder (PS) [lOl, and work in the ap- 



proximation that the electron and positron are massless. 
In this notation p and p' denote the incoming 4-momenta 
of the electron and positron, while k and k' denote the 
outgoing 4-momenta of the muons or smuons. The pho- 
ton 4- momentum is denoted hy q = p-\-p' . We will use m 
interchangably to denote the mass of the muon or smuon, 
assuming them (in this pedagogical example) to be de- 
generate. 

The leading order QED matrix element for e+e^ 



43 



LH2 vs. CS7 [1000 pb~^] 
Variable LH2 CS7 Separation 
MET 



r(mT2-500) 0.27 0.08 


18.87 


r(MET420) 0.48 0.20 


16.73 


r(MET520) 0.21 0.07 


14.49 


DiJet 


r(4j)(3j) 0.20 0.67 


7.30 


r(mT2-300) 0.72 0.31 


6.73 


r(mT2-400) 0.53 0.22 


6.26 


TriJct 


r(mT2-500) 0.20 0.04 


8.83 


r(mT2-300) 0.68 0.32 


7.43 


r(mT2-400) 0.53 0.22 


7.18 


Muon20 


r(mT2-300) 0.84 0.35 


1.57 


r(mT2-400) 0.60 0.24 


1.32 



TABLE XXXIX: Best discriminating ratios in each trigger 
box, with separations in units of a, for the comparison of LH2 
vs. CS7, taking LH2 as the "data", assuming an integrated 
luminosity of 1000 pb~^. 



LH2 vs. CS7 [1000 pb^ 



Variable 


LH2 CS7 Separation 


MET 


r(mT2-500) 


0.27 0.08 


18.87 


r(MET420) 


0.48 0.20 


16.73 


r(MET520) 


0.21 0.07 


14.49 


r(mT2-600) 


0.05 0.01 


14.11 


r(mT2-500/300) 


0.32 0.12 


11.17 


r(mT2-500/400) 


0.43 0.19 


9.77 


r(mT2-600/300) 


0.06 0.01 


9.77 


r(mT2-400) 


0.63 0.40 


8.46 


r(MET320) 


0.78 0.53 


8.17 



TABLE XL: Best discriminating ratios in the MET box, with 
separations in units of cr, for the comparison of LH2 vs. CS7, 
taking LH2 as the "data" , assuming an integrated luminosity 
of 1000 pb-^ 



/i^/i IS 



r {p'){~ier)u'(p) [ ) u^{k){-ie^^Y (fc') (CI) 



The corresponding matrix element for e+e 



{p'){-tejnu%p) [^^j He{k, - fcp) (C2) 

In each case, we compute the squared matrix element, 
averaging over the spins of the electrons. For e~^e~ — > 



we also sum over the spins of the muons, thus 

3E«,.'..,.' \Mis,s',r,r')\^ = ^^tr [#7^ ^Y] 

xtr[(K + m)7^(K'~m)7,] , (C3) 

while for e+e^ — )■ flRflR we have 

^tr yrikf. - k'^) ^rik. - ki)] . (C4) 

From now on we will follow the convention of Ellis, Stir- 
ling and Webber (ESW) and use a barred summation 
to denote the average over initial spins and sum over final 
spins (if any). Thus performing the traces (jC3[l becomes 



El-^l' - ^[ip-k){p'-k') + {p-k')ip'-k)+m^{p-p')] (C5) 
while (IC4D becomes 



EI-^P ^ ^[{p-p'){k-k') - {p-kw-k') - {p-k'){p'-k) 

+ {p-k){p'-k) + {p-k'){p'-k') - m^{p-p')\ . (C6) 

The kinematics of both cases are identical; in the center 
of mass frame: 



p-p 



2' 



k-k' 



p-k — p'-k' = -(1 — /3cos 



p-k' 



p'-k 



(C7) 



(1 + /3cos( 



where 9 is the polar angle of the final state muon or 
smuon, and 



Substituting, (|C5|) becomes 

E|A^I' = e4 [2 _ /32(i _ cos20)] , 
and (jC6[) becomes 

^|A1|2 ^ 4/52(1 _C0S2^?). 

Thus the differential cross section for e+e^ - 
leading order in QED is 

d{cos6) 2s ' ^ 



while for e^e 



fj-Rp-R we get 



da 



d{cos0) 4s 



■(3^ (1 - cos^ei) , 



(C8) 
(C9) 

(CIO) 
at 

(Cll) 
(C12) 



agreeing with Farrar and Fayet [66| . 

Since a Dirac muon has two complex scalar superpart- 
ners, (Ir and (11, from now on we will write the combined 
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cross section for e+e~ ij-Rfj-R + P'lP'L, which is just 
twice the expression in (|C12p . Thus we have the total 
cross sections: 



~3r 



(C13) 



In the high energy hmit /3 ^ 1, the leading order muon 
pair cross section is exactly twice the smuon pair cross 
section, as noted for example in [93 |. 



2. qq QQ versus qq qq 

From these formulae it is easy to obtain the lead- 
ing order cross sections for hadroproduction of heavy 
quarks/squarks from light qq initial parton states. First 
we introduce a kinematic notation more suitable for 
hadroproduction. The subprocess Mandelstam invari- 
ants are given by 



-(l-/3cosf 



2m 



m' 

s + t + u . 



-(1 + [icosl 



(C14) 



It is convenient to use the dimensionless variables defined 
by ESW: 



t 



T2 



(C15) 



P = 1 - /3' ; 
1 = T1+T2 

We change variables using 

d(cos6') _ 2 

di ^ l3s ' 



(C16) 



Thus (jClip . the leading order QED differential cross sec- 
tion for e^e^ — + ^'^ fj,~ , becomes 



we replace a by and insert a factor of 2/9 to account 
for the color factor, averaging over initial colors, and sum- 
ming over final colors. To make an apples-with-apples 
comparison we assume that the squarks are of a different 
flavor than the initial state partons; then both leading 
order processes arise from a single s-channel diagram. 

Thus we get the fully differential cross section for qq 
QQ: 



(Pa 



-T#/1(X1)/2(X2) 



dxidx2dt 9s 
and for qq qRqn + qL<lL we have 
d^a ina'^ 



, (C19) 



dxidx2di 9s2 



-flixi)f2{x2) 



(C20) 



where fi{xi) and f2{x2) are the pdfs for the initial state 
quark and antiquark. 

We can compare the heavy quark cross section (jC19p 
to ESW by making the change of variables 



d'^a 



dysdyid'^PT 



X1X2 



n 



d^a 



dxidx2dt 



(C21) 



where 2/3, 2/4 are the lab frame rapidities of the heavy 
quarks, and pt is the transverse momentum. Thus 



dysdy^d'^PT 9s 



-^xif 1X2/2 



,(C22) 



agreeing with eqn. 10.51 of ESW. 

Similarly, we can compare the differential cross section 
for squark production (|C20[) to the literature. The com- 
parison requires a couple of kinematic identities: 



lit — m — — 



1 2 2 P 

1 - Ti - r, - - 



(C23) 



ui - = - [s(s - 4m2) - (m - tf] . (C24) 



Using (|C23|) , we see that the differential cross section for 
squark production (IC20[) agrees with Dawson, Eichten 
and Quigg (DEQ) [9^, and (|C24|) shows that we agree 
with Harrison and Llewellyn Smith ^94i] . 

The total cross sections are obtained by integration: 



da 27ra^ 



dt 



while the e+e fj^Rf^R + fi'Lfj'L cross section is 
da 2-110^ 



dt 



1 ^2 ^2 P 

1 — Tl — To — - 



(C17) 



(C18) 



To convert these formulae into cross sections for the 
leading order QCD subprocesses qq — > QQ (heavy quark 
production) and qq qRqR + qLQL (squark production). 



<^{qQ ^ QQ) = I dxidx2 I dt- 



dxidx2di 



(C25) 



47ra: 



dxidx2 



/1/2 



Wi(1-2ti+2t2 + ^) 



Using r"' 



i(l + /3), rf"" = 1(1-/3), this becomes 



aiqq^QQ) = ^ j dxidx2hh^-^^p^ P (C26) 
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which agrees with the result of Combridge [95| . The anal- 
ogous total cross section for squark production is 



27 „ 
47ra^ 



dxidx2fih 



(s — 4to^) 



13 (C27) 



dxidx2fif2- 0^ 
11 I s 



which agrees with Harrison and Llewellyn Smith [94| . 



3. gg ^ QQ versus gg — > qq 

For gluon fusion, we start with the leading order matrix 
elements as given by ESW and DEQ. At leading order 
there are three diagrams for each process, corresponding 
to the s, t and u channels, and an additional gluon seagull 
diagram for the squark case. In the t and u channels we 
are not only producing particles of different spins but also 
exchanging particles of different spins. For gg — s- QQ we 
have (see Table 10.2 of [U): 



j:\M\' = gt 



S V 6ti T2 



4tiT2 



, (C28) 



while for gg gnqR + (IlQe we have (see eqn. 3.26 of 

My- 



E\M\' 



3s ( 48 



3(-a-t)" 



4m 



(C29) 



Converting to the ESW kinematic variables and expand- 
ing, (IC29P becomes 



(C30) 



6tiT2 32Tir2 



-I- 



This can be refactored into 



j:\m\' = 



(C31) 



\ DTlT 



1 



P 



4tiT2 



Notice that the sum of the squared matrix elements for 
heavy quark and squark production with the same mass 
has a very simple form: 



(C32) 



\ 6tit 



An analogous simplification also occurs in the qq initiated 
production, as is obvious from comparing (|C17|) to (|C18|) . 

It does not appear that these elegant SUSY relations 
have ever been noticed in the literature! This may be 
because these are not, strictly speaking, MSSM relations. 
In the MSSM, electroweak symmetry breaking does not 



occur in the SUSY limit where the soft breaking terms 
are turned off. Since the MSSM fermions are massless in 
the absence of EWSB, there are no MSSM cross section 
relations between degenerate heavy quarks and squarks. 
In the simple processes that we considered above, the 
SUSY limit actually corresponds to some more generic 
vectorlike SUSY theory. 

The corresponding fully differential cross section for 
gg -> QQ is 



dxidx2dt 



^/1(X1)/2(X2) 



(C33) 



/ 1 



\6riT2 8 



r 2 +tI+p- 



4tiT2 



and for gg qRqR + qLqL we have 
d^a 7ra? 



dxidx2di 



flixi)f2{x2) 



(C34) 



1 



1 2 2 



^6tiT2 8 
The total cross sections are 



o'(g.g QQ) = [ dxidx2^^ 
48 ./ s 



4tiT2 



^59/3 



+31(3^ + [32 - I6/32 + (1 - /32)2] In 

/1/2 

s 

1 + P 



<^{99 mm + qmL) = j dxidx2 

-31/33 _ [ig(i _ ^2) _ ^2)2] 



1-/3 



; (C35) 
.(C36) 



4. gq ^ QW versus gq qW 

The process gq — > QW contributes to single top pro- 
duction in the Standard Model via qb — > tW^ . At leading 
order there is both an s-channel and a i-channel diagram. 
This can be compared to the SUSY process gq qW, 
i.e., the associated production of a squark with a Wino. 
Again in the corresponding t-channel diagrams we are 
exchanging particles of different spins (a quark versus a 
squark) . 

This process obviously cares about EWSB and the fact 
that we have chiral fermions rather than vectorlike ones. 
Thus as already explained above we do not expect an 
elegant SUSY limit relating the two processes. Indeed 
one observes intrinsic differences already at the level of 
comparing the W and Wino decay widths into Qq and 
qq, respectively. Assuming that these decays were kine- 
matically allowed, we can extract the leading order ex- 
pressions from eqn. 5.15 of [96| and eqn. B.88a of [97j . 
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in the limit that we neglect the light quark mass: 



nW ^ Qq) 



r{W ^ qq) = 



w 



487r 
487r ' 



'"W 



'■w 



H 2 



;(C37) 
(C38) 



where as before m denotes the heavy quark or squark 
mass. These formulae coincide in the limit m = mw = 
m^^, but this is a kinematic limit, not a SUSY limit. 

The fully differential cross section for gq QW was 
computed at leading order by Halzen and Kim 98] , while 
the leading order cross section for gq — s- qW is given in 
DEQ [9^. We convert these expressions to ESW nota- 
tion, introducing a new dimensionless variable 6: 



(C39) 



with mw replaced by rriy^ in the SUSY case. We also 
replace (|C8p by the definition of (3 appropriate for two 
unequal mass final state particles: 



13. 



1 



(m + 7rLw) 



1 



(m — rawY 



(C40) 



where the integrals were performed using the kinematic 
relations: 



(1 + /3-5) ; 
[1-13-5). 



(C45) 
(C46) 



To compare the total cross sections, it is much simpler 
to consider the special limit m = mw — Jtt,^, i.e., the 
case (5 = 0. Then ((C43)) and ((C44)) reduce to: 



a{gq ^ QW) = 

48 7 ^ s 4 



-/3 + 61n 



1 + /?' 

1-/3 



a{gq qW) 



9w^s 
48 



dx^ dx2 ^^^^ — (3 



(C47) 



(C48) 



with (3 given by (|C8|) . 

Note that the non-SUSY total cross section is much 
larger than the SUSY cross section for partners of the 
same mass. The ratio of the cross sections at fixed s is 
11/2 at threshold (/3 = 0) and rises monotonically from 
there, reaching 9.3 for /3 = 0.9 and diverging logarithmi- 
cally in the limit sjw? 00. 



Thus for gq QW we obtain 



dxidx2di 48s^ 
26 



p + 4:d 

and for gq — > qW we have 



■fi{xi)f2{x2) 
26 + Ti 



1 



AS 



p + AS 

1 - 26{1 - S) pS 
n 



(C41) 



d^a 



dxidx2di 



(C42) 



1 - 2(5 - ri 



2(5(1 -(5) 



Pl_ 
2tI 



The total cross sections are 



a{gq ^ QW) = 4? / dxidx2 



s 4[(1 + (5)2 - /32] 
13 [-1 + (1 - 2l5)/32 + (5(31 + m + 21(5^)] (C43) 



+2 [1 - 2(5(1 - (5)] [3(1 - + (5(2 + 3(5)] In i±|--j 



(j{gq qW) 



9w^s 
48 



dxidx2 



/1/2 



1 

X - 
2 



/3 - 7(5/3 -t- 4(5(1 - (5)ln 



1 + /3-S' 
1-13-5 



(C44) 
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